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Nl LabVIEW. 

Limited Only by Your Imagination. 



Build and program robots 
with LEGO' MINDSTORIVIS* 
NXT using software powered 
by Nl LabVIEW 



Develop your human machine 
interface (HMD display 



Independently control 
multiple servo motors 




Communicate via 
multiple protocols 
including Bluetooth 



Graphically program 
concurrent, real-time 
applicatiQns 



Target 32-bit 
microprocessors 
and FPGAs 




Real-Time and Embedded 
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PRODUCT PLATFUHM 
LabVIEW ReaUnme Module 
LabVIEW^m Miodule 
LabVIEW 

Microprocessor SDK 

Nl CompactRIO Embedded 
Hardware Platform 



When the LEGO Group needed parallel programming 
and motor control tools intuitive enough for children, 
it selected graphical software powered by Nl LabVIEW. 
With LabVIEW graphical system design, domain experts 
can quicl<ly develop complex, embedded real-time 
systems with FPGAs, DSPs, and microprocessors. 
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Cover feature: 

Designing DSP-based motor 

control using fuzzy logic 

BY BYRON MILLER 

Use of variable-speed drive motors will require a shift from 
PID controllers to systems based on fuzzy logic algorithms. 





Lower the cost 
of intelligent 
power control 
with FPGAs 

BY WENDY LOCKHART 
Combining a programmable solution 
with an industry-standard processor 
core can save time, money, and real 
estate. 



Virtuolizing embedded Linux 

BY GERNOT HEISER 

The system virtual machine can be used to make Linux-based 
applications faster and more responsive and secure. Here's a 
primer to get you started. 
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Back to 
the future: 
Manchester 
encoding 

i; POBERT GUASTEn A 
When commercial options fail, 
try using Manchester encoding 
and other time-tested protocols in 
low-cost, low bit-rate serial com- 
munications. 
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Is multicore 
hype or reality 

BY JACK G. GANSSLE 
Multicore processors are 
here to stay but memory 
is a bottleneck. 



New Products 



A round-up of the latest 
hardware and software. 
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While your competitors waddle about with Linux, fly past them with the INTEGRITY® real-time 
operating system from Green Hills Software - the only gTOS designed specifically for 21 st century 
embedded applications. 



Maximum performance 

• True real-time performance with response 
times hundreds of times faster than Linux 

• The fastest compiled code - semiconductor 
manufacturers choose our compilers 3x 
more often than GNU for benchmarks 

The lowest manufacturing cost 

• A footprint as small as 3 KB of RAM - 

hundreds of times less than Linux 

• The most compact compiled code - 
Green Hills® compilers es%n reduce the 
size of Lfntix by 35% compared to iNU 

Total reliability & absolute sieaifity 

• Secure partitioning with guaranteed 
resource availability that thwarts denial-of- 
service attacks, hackers, andTrojan horses 

• Bulletproof reliability - proven in safety- 
critical and flight-critical avionics systems 
deployed today 



The fastest time-to-market 

• The only POSIX® 2003 certified RTOS 
(even fork and exec) - maximizing source 
code [wrtability 

• Instant bring-up on new hardware, with 
or without a BSP 

• Support from the original developers - for 
the fastest and most informed response 

And with the INTEGRITY RTOS, you get ail you 
expect from a modern operating system: 

✓ Royalty-free 

/ Memory protection 

/ Over 1 00 device drivers and BSPs 

/ IPv4, IPv6, IPsec, SSL and more 

/ Broad processor support 

/ Source code available 

/ A complete and tightly integrated 
development environment 




Choose the INTEGRITY RTOS and soar over your competition by getting your electronic products 
to market first at the lowest cost - with the utmost performance, reliability, and security. Call us 
today to learn how easy it is to get started. 

•SOFTWARE, I M C. • 

www.ghs.com/fly 

i^EMBEDDEDSYSTElVISi 

1,2&3April2008-Hall8, Stand B10 
Paris Expo - Porte de Versailles 



France: +33 (0)1 43 14 37 00 
Israel: +972 (0)9 9584060 
Sweden: +46 (0)46 211 33 70 



Germany: +49 (0)721 98 62 580 
Netherlands: +31 (0)33 4613363 
UK: +44 (0)1844 267950 



Copyright © 2008 Green Hills Software, Inc. Green Hills, the Green Hills logo, and INTEGRITY are registered trademarks of Green Hills Software. POSIX is a 
registered trademark and 1 003. 1 is a trademark of the Institute of Electrical and Electronic Engineers, Inc. All other trademarks are the property of their owners. 




^ollandse J 8@mac.com 
Editorial address: PO Box 32444, Lontion SBtB-'i 

Adrertismg Prodacdon Manager 
Lydm Gijsegcmi & Mag^FIeaadun 

Td +32 (0) 2 740 00 50 
email production@eetimes.be 

Circulation 
Luc Desimpel 
Tel +32 (0) 2 740 0055 
)kimail: lucJesimpd@eetimes.be 

Art Manager 
Jean-Paul Speliers 
Tel +32 (0)2 740 0052 
email: jean-pauLspeliers@eetimes, be 

Finance 

Rkardo Pinto Ferreira 
Tel +32 (0)2 740 0051 
email: ricaTdo.pintoferTeira@eeHmes.be 

Advertisement contacts 
see page 43 

Editorial Director 
Jean-Pierre Jloosting 
Td+32(0)2 74O0(^ 
emaib jean-piem.joosting^eetimes.be 

Publisher 

Andre Rousselot 
^Xe/ +32 (0)2 740 0053 
W^'email: aroussel@eetimies.be 

European Business Press 
144 Avenue Eugene Plasky 
^i030 Brussels - Belgium 
r Tet +32 (0)2 740 00 50 
Fax: +32 (0)2 740 00 59 
Email: info@eeUmei.bc 
www.embedded. com/europe 
"LVAT Registration: BE4t61.357.437 
IpM; Brussels 

f Number: 0461357437 



:h insights. 

ic creators of tedmolegyM^ 




Techinsights - a division of 
United Business Media, LLC 
Paul Miller 

President, Chief Executive Officer 



© 20O8 EB.P. SA. a Techinsights company 

• Embedded Systems Design Europe is published eight times 
a year. All rights reserved. No part of this publication may 

I be reproduced or transmitted in any form or by any means, 
ctronic or mechanical, including photocopying, recording or 
talion storage or retrieval system without the ejqtress 
ton consent of the publisher. The c ont en ts of Embedded 
isign Europe are sul^t to reproduction in infarma- 
jf and retrie\-al systems. - 

* Embedded Systems Design Eurvpe is available free to tjuali- 
fied readers and so others for an annual subscription of €150. 
Refunds an canceUed subscriptions wiU only be provided at the 
PfMsher's discretion. St^isaiption queries Ui: +32 2 740 0050 



Software security - 
a matter of life and deatli 



Last month in this column we were 
anticipating an interesting visit 
to Embedded World in Nurem- 
berg and the event turned ovct to meet 
all our expectations. The exhibition set 
new records with 675 exhibitors - up 
14 percent - and 17,341 trade visitors 
from all over the world - up 27 percent. 
Particularly impressive was the rise of 
56 percent in international attendance. 
Engineers thronged the exhibition aisles 
and conference centre to be informed 
and educated but it was a bonus to be 
entertained. 

David Kleidermacher, CTO of Green 
HiUs Software, did exactly that in his 
executive summit keynote which looked 
at how healthy software is, especially in 
safety critical systems. 

Kleidermacher felt the general pub- 
lic think bad software security is only 
the subject of futuristic adventures and 
used the Bruce Willis movie coOoquially 
known as Die Hard 4 but ofiicially titled 
Live Free or Die Hard as an example. 

In the film cyber terrorists were able 
to hack into Washington DCs traffic 
control system and turn all the lights 
green causing chaos. Kleidermacher was 
watching the movie with his wife who, 
taking advant^e of having an expert 
in the family, posed the question: Can 
this really happen? It turns out that the 
proposition was not that far fetched with 
a number of examples on record includ- 
ing a hack of the LA traffic light system 
and reports that the U.S. Accountability 
Office found that the nations air traffic 
control system was wide open to hacking 
while in a separate incident a PC crash 




Colin Holland is 
editor 0^ Embedded 
Systems Design 
Europe. He can be 
contacted at 
cholland@cmp. com. 



put 800 planes in jeopardy with pilots 
having to use cell phones to communi- 
cate with each other. 

In another scene in the film the good 
guys persuaded an operator in a remote 
call center to switch on the ignition of 
a car for whidi they didn't have the 
keys. That might seem far fetched said 
Kleidermacher but maybe not, as late last 
year General Motors aimoimced an im- 
proved version of its OnStar telematics 
system that could be used to shut your 
car's engine off remotely. "Be afraid, be 
very afraid," warned Kleidermacher. 

What about the power grid. In the 
film Willis watches from a hill at night 
as sections of the Washington DC land- 
scape go black in turn as the hackers turn 
the power off. Another exaggeration, no 
said Kleidermacher, in 2004 a software 
bug contributed to a blackout of north- 
ern eastern U.S. As a result of the outage. 
New York City reported 60 serious fires 
and 3,000 fire calls, 800 elevator rescues, 
and 80,000 calls to 911. 

As the movie comes to its climax a 
flight of F-35 jets are scrambled and one 
jet's command and control communi- 
cations system is hacked into and it is 
ordered to attack Willis destroying much 
of the freeway in the process. 

Not another far-fetched plot that has 
actually liappened? At last Kleidermacher 
gets his pay off The F-35 uses Green 
Hills' INTEGRITY operating system for 
its mission and display systems, its com- 
munication/navigation/identification 
systems and crypto engine. It is based 
on EAL6-I- certifieation- Windows 
and Linux are only EAL4 level - and 
this higher level is the security we need 
to stop our weapons being turned on 
ourselves, explained Kleidermacher. 
But this level could also be used selec- 
tively in commercial applications using 
multicores and h|fpemsors«imfaiDve 
security. 
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Distributors to increase embedded focus 



The two top electronic component 
distributors are both addressing the im- 
portance of the embedded systems sec- 
tor by launching specialist divisions. 

Arrow is to increase its European 
focus on integrated systein-level tiech- 
nologies with launch of Advanced Em- 
bedded Solutions (AES). 

AES is designed to provide a Euro- 
pean focus on platform-level technolo- 
gies, AES covers integrated solutions 
including displays, industrial computer 
products, solid state storage and power 
supplies, as well as wireless modules for 
machine-to-machine (M2M) commu- 
nication and operating systems. 

The new organization is headed by 
Chris McAneny as European embed- 
ded solutions group leader. McAneny 
is responsible for AES teams in Arrow 
organisations throughout Northern, 
Central and Southern Europe. 

"Pressure on in-house engineering 
resource is affecting the 'make versus 
buy' balance. Increasingly, customers 
want to focus design effort around core 



Notebook computer manufacturer 
Quanta Computer, Inc. (Taipei, 
Taiwan) has agreed with embedded 
computer vendor Kontron AG (Ech- 
ing, Germany) to take a minority 
stake in Kontron's manufacturing 
subsidiary Kontron Asia, Inc. With the 
move, Kontron hopes to benefit from 
Quanta's manufacturing expertise and 
improve its productivity. 

The agreement wiU see Quanta buy 
21 percent of Kontron Asia's shares. 
The closing of the transaction is ex- 
pected before end of March according 
to Kontron. The move effectively cre- 
ates a joint venture of both companies. 
Kontron Asia's largest asset is a manu- 
facturing site in Penang, Malaysia. 

Kontron will maintain the opera- 
tive leadership in the joint venture, 
while both companies hope to benefit 
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competencies while buying in solutions 
that will allow them to minimise time- 
to-market. AES brings together the 
technologies and resources that address 
customer demand for the integrated 
solutions, while Arrow's component 
teams continue to deliver the products 
and technical support for customers' 
own circuit designs," said McAneny. 

Among the technologies within the 
AES portfolio are LCDs, single board 
computerSi system-on-module (SOM) 
solutions, solid stage storage, embedded 
operating systems, AC/DC power sup- 
plies, DC/DC converters and Bluetooth, 
ZigBee, Wireless LAN and ISM radio 
modules. A host of related and comple- 
mentary technofe^i^ aisi services 
including wireless antennae, display 
controllers, configuration services and 
traiiung will also be handled diroug^ 
the AES channel. 

Avnet Electronics Marketing 
EMEA's new channel for embedded 
systems, networking and display prod- 
ucts, is called Avnet Embedded. 



equally from the move: Kontron aims 
at achieving the same conditions in ma- 
terial procurement as the much larger 
Taiwanese partner. In addition, Kon- 
tron plans to benefit from economies of 
scale through the partnership. Quanta 
is said to be the world's largest note- 
book computer manufacturing services 
provider, producing about 30 percent 
of the world's notebook motherboards. 
For Quanta, the move could translate 
in a diversification of its production 
capacities for industrial markets. 

■ Kontron is also in the process of buy- 
ing Thales Computers S.A., a company 
which expects sales of about €20 mil- 
lion in its present financial year. The 
purchase is subject to approval by the 
French go\'ernment with deal expected 
to be finalized in March. 



It will initially do business in Ger- 
many, Italy, Spain and Eastern Europe, 
but over time extend its activities to 
other major European markets. Avnet 
Embedded will be led by Alessandro 
Brazzoni, formerly manager of the em- 
bedded division at Avnet Memec, now 
director of Avnet embedded sales & 
operations. 

Avnet Embedded will represent a 
number of manufacturing partners, 
including embedded boards from AAE- 
ON, Advantech, aValue, Protech, TQ 
Components and VLA Technologies; 
networking products from Atop, Delta 
Design, Digi International, Lancom, 
Maxstream, Tibbo, and Volktek; dis- 
plays from Litemax, Promate (AUO), 
Sharp, Varitronix, and Xiamen. Not 
all of the listed manufacturers will be 
available in all countries of operation 
from the beginning. 

Patrick Zammit, president of Avnet 
Electronics Marketing EMEA, said 
"The market for embedded system 
products beyond the pure chip level 
is increasing rapidly. Many customers 
ask for solutions of higher integration, 
to reduce time to market and develop- 
ment costs, and therefore are starting to 
work with finished and semi-finished 
solutions. By providing embedded sys- 
tems products, we just naturally extend 
our offering to meet some of our cus- 
tomers needs." 

Brazzoni added, "The strength of 
the Avnet speedboats is the excellent 
customer relationship combined with 
technical expertise and the focus on a 
limited number of franchises. Avnet 
Embedded entertains a completely new 
and different line card with complex 
products that need separate technical 
support and, to a large extent, a differ- 
ent logistic handling. Offering board 
level solutions and displays through 
a new product group enables Avnet 
- through Avnet Embedded - to lever- 
age the excellent market penetration 
of Avnet EM-EMEA while creating a 
specific technical focus for embedded 
products." 
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Kontron, Quanta join forces 



For land-based or airborne applications, the military needs COTS embedded sub-systems that deliver the 
optimum combination of performance, reliability, cost and rapid time-to-market. And Aitech has delivered 
just that since 1983! 

We're still in front. . . Aitech got a big jump on everyone way back when. . .and we've kept well ahead of the pack in developing 
advanced embedded sub-systems based on leading edge commercial technologies adapted to harsh environment applications. 

We don't compete with you... A lot of board level companies claim to be systems integrators. We don't. We leave the 
systems integration to the companies that do it best: our customers.. .companies just like yours! 

We have what it takes. . . Our VME and CompactPCI SBCs, PMCs, memory, I/O, graphics and chassis are in a class by themselves. 
And they're backed by world class software and technical support, program and lifecycle management, testing and more. 

We've been there. . . Aitech sub-systems are in widespread use in the world's most demanding military/aerospace applications: 
from radar, missile and weapons control to mission-critical data voice sub-systems. . .from mission computers and display consoles 
to ECM, sight head stabilization and counter measure control. . .from power control and conditioning to advanced imaging systems. 
And that's just for starters. 

Don 't' tal<e our word for it... Compare Aitech's offerings and expertise to any company in the COTS embedded marketplace. 
Land or air Or for that matter, sea or space. Come to talk to us. 




Coverity raises $22m as 
European business booms 



Coverity Inc. has raised its first round 
of external fianding with a $22 million 
investment by Foundation Capital and 
Benchmark Capital. The company has 
seen five consecutive years of positive 
cash flow growth and also grew in Eu- 
rope by 300 percent in 2007. 

It will use this investment for acqui- 
sitions, internal product development, 
and fiirther expansion into international 
markets. The company says its Prevent 
static analysis solution is now used by 
hundreds of thousands of developers. 

The company's expanded product 
portfolio will allow Coverity to establish 
a new presence in targeted segments of 
the market for application Ufecyde man- 
tgememt t&ok, a sector v^Meb Gaclttar 



has predicted will grow in European to 
more than $2.9 billion in 2011. Steve 
Machernis, European managing direc- 
tor at Coverity said, "Coverity is hiring 
aggressively throughout Europe while 
cultivating our network of partners." 

Coverity has recendy augmented 
its board of directors by adding Tony 
Zingale, who was formerly president and 
CEO of Mercury Interactive; Aki Fu- 
jimura, who is currently chairman/CEO 
of D2S and formerly was CTO of Ca- 
dence Design Systems; Bruce Dunlevie 
of Benchmark Capital; Paul Holland of 
Foundation Capital; and Mike Schuh of 
Foundation Capital who was formerly 
CEO at Intrinsa, and vice president of 
Mes at Clarify and Cadence. 




C16x - STIO 
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universal debug engine for 16/32 Bit Microcontroller 
On Top Solutions for System Development 




Please contact u^^tjf rnpre informatii 

www.pls-mc.com 




Help is on hand for 
Europe^s industrial 
control developers 

EE Times Europe has laimched its 
first DesignLine - Industrial Control 
DesignLinc pAirope. 

The site offers product news, in- 
depth technical papers and how-to 
articles, with a European perspective. 

The website supports engineers 
during the detailed design phase of 
their project, when they need to delve 
deeply into the software and hard- 
ware details of their designs. During 
this phase, engineers need access to 
many different types of information, 
ranging from product specifications 
to how-to instructions and trouble- 
shooting hints. 

The information on the site is 
focused on this phase of the product 
development cycle as well as on the 
specific requirements of industrial 
electronics - and all this with an un- 
mistakably European flavor which it 
will develop over time. 

There is also an editor's blogs, 
commenting on latest trends and 
events in the industry. 
See: www.industrial-europe.com 

Milestones in 
embedded systems 

How far we've come? Jack Ganssle 
and the Embedded Systems Design 
staff have created a list of some of 
the important happenings in the 
history of embedded systems. It can 
be viewed at www.embedded.com/ 
timeline and we invite comments 
and su^estions. Are we missing any 
events or inventions that are perti- 
nent to embedded systems? Do you 
have any good photos you're willing 
to share? Send your suggestions and 
photos to srambo@techinsights.com 
or log on to the fotlon on flie web- 
site and post your comments. 
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lackfin is driving innovation 
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The Audi AS Coupe — Only Blackfin has the horsepower to 
drive multiple functions in the A5's innovative infotainment 
system. And when media formats and communications 
standards change, Blackfin's software flexibilily will make 
Deld upgrading easy. 



Taking car infotainment wliere it's never been before 

*rheAudi A5's advanced electroniGS create harmony t^eetVdriver^and 
car— and drive competitive advantage. Tlie Biackfin® Processor does too, 
with scalabiiity, connectivity, and unbeatabie performance (a trait shared 
with the SHARC® Processor, featured in the A5's optional Bang & Oiufsen 
surround-sound amplifier). Blackfin powers the digital audiobroadcastand 
MP3-compatible 6-CD changer in the Audi Symphony Radio. Blackfin also 
enables the Audi Music Interface, integrating portable media players like 
the iPod® for dashboard display and easy steering wheel control. When it 
comes to the Audi A5, Blackfin really Is everywhere. 



Audi 



Get started using Blaclcfin now: 
analog.com/automotive 

Tel: +44 (0)1932 358 530 Fax: +32 (0)11 300 635 



ANALOG 
DEVICES 



@2007 Analog Devices, Inc. Pnxhict and compaiv names listed are tiademaitcs or trade names of their 
respective owners. 



Microsoft is recruiting for 
embedded center in Aachen 



Microsoft Corp. has launched its first 
Microsoft Embedded Systems Develop- 
ment Centre (MESDC) which will be 
based in Aachen, Germany. 

The MESDC is part of Microsoft's 
effort to expand regional development 
centers in Denmark, France, Ireland, 
Serbia, the U.K. and other countries 
across Europe. 

Located within the European Mi- 
crosoft Iimovation Centre (EMIC) in 
Aachen which was set up in 2003, the 
company says the MESDC is a signifi- 
cant part of the $75 nullion global R&D 
investment that the Windows Embed- 
ded Business is making in Europe this 
fiscal year. 

Microsoft is recruiting embedded 
systems engineers for the MESDC and 
the aim is to bring up to 15 engineers 
on board by the end of 2008 to work 
closely with the U.S. Windows Embed- 
ded product development team based 
in Redmond, Washington. 

The MESDC will support global 
product R&D, development of Micro- 
soft's embedded operating systems, and 
accelerate collaboration between the 
U.S. -based Microsoft product groups 



Congatec (Deggendorf, Germany) and 
SECO (Arezzo, Italy) have jointly cre- 
ated the Qseven form factor for em- 
bedded computing using the latest low 
power chipsets such as Intel's Menlow 
platform. Targeting next generation 
embedded processors built using 45nm 
technology, Qseven format is a board 
measuring 70 x 70mm and a maximum 
power consumption of around 12W. 

Where heat dissipation is an is- 
sue, a thermal cooling interface has 
been defined to help transfer any heat 
generated to a cooling solution. It will 
also provide connectivity through in- 
dustry standard interfaces, including: 

MARCH 2008 | Embedded systems design 



and their counterparts in Europe. 

The MESDC will also support the 
needs of the active Windows Embedded 
customer and partner ecosystem in Eu- 
rope by engaging with select members 
of this group to showcase high-visibility 
embedded systems projects that accel- 
erate embedded development. 

"The MESDC will form an mtegral 
part of helping us recruit outstanding 
embedded systems engineers across Eu- 
rope to perform product R&D, and the 
MESDC located in the EMIC in Aachen 
provides a very conducive environment 
for these engineers to spearhead incu- 
bation projects and accelerate technol- 
ogy transfer and collaborative efforts 
within Microsoft," according to John 
Lefor, the director of the European Mi- 
crosoft Innovation Center in Aachen. 

Kevin Dallas, general manager of 
the Windows Embedded Business at 
Microsoft, added, "Due to the immense 
engineering talent pool and high con- 
centration of enterprise customers and 
key Windows Embedded partners in 
the region, we see tremendous growth 
opportunities for the Windows Embed- 
ded Business in Europe. 



4x PCI Express; 2x SATA; 6x USB 2.0; 
Ix lOOOBaseT Ethernet; 2x SDIO 8 bit; 
LVDS 2x 24 bit; DVO/SDVO (shared); 
video input port (\^P); high definition 
audio (HDA); PC bus; and low pin 
count bus. By leveraging the Mobile 
PCI Express Module connector format, 
Qseven offers three different connec- 
tor heights, from 4.3mm to 7.8mm. 
Through this configuration it is hoped 
the Qseven platform v«ll be as simple to 
integrate as a DIMM memory module. 

The companies plan to release 
a general specification by the end of 
April, with n full design guide available 
by May 2008. 



European designers 
to win cash for 
green designs 

Freescale Semiconductor used the 
Embedded World conference to open 
registration in Europe for its first 
worldwide design contest. Top regional 
winners will be invited to compete in 
the Grand FTP Design Challenge, com- 
peting against other first place teams to 
win a total of $61,000 in cash prizes. 

The Freescale Technology Forum 
(FTP) Design Challenge Europe is 
focused on the development of sustain- 
able applications to encourage engi- 
neers to test the boundaries of green 
embedded systems design. 

"Energy management and other 
green issues are becoming major con- 
siderations in virtually every purchase 
decision: consumer or corporate," said 
Steve Wainwright, vice president of 
sales and marketing & general manager, 
Freescale EMEA. 

Embedded systems engineers and 
students are challenged to create in- 
novative, green electronic designs using 
a select group of Freescale solutions. 
Once registered, participants receive a 
kit with hardware and software devel- 
opment tools to start on their designs, 
submitting final design papers by May 
23. The judging process will coincide 
with FTP Paris, which will be held Oc- 
tober 7-8, where top regional winners 
will be recognized and awarded cash 
prizes. 

Freescale is hosting Design Chal- 
lenges in the Americas, Japan, Europe, 
China and India. To be eligible to 
participate in the Europe Design Chal- 
lenge, participants must be 18 years 
or older and live in Europe including 
Israel. First place regional winners wiU 
receive $10000, in addition to being 
invited to participate in the Grand 
FTP Design Challenge. Second place 
winners receive $5000 and third place 
$2000. Additional information includ- 
ing entry requirements and the official 
ruJes are available at www.fteescak. 
com/ designchallenge. 
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Europe invests in Java for multicore systems 



The Open Group is working with a 
consortium of European real-time 
technology developers, industrial man- 
ufacturers and research organizations 
to develop a new framework for Java- 
based real-time applications on modern 
parallel processor systems. 

Supported by the European Com- 
mission, the Java Environment for 
Parallel Real-time Development (JEOP- 
ARD) project is investing over €3.3 
million on an advanced framework for 
real-time Java ruiming on multicore 
and parallel systems. 

This platform-independent frame- 
work will maintain the robust reliability 
essential for safety and mission critical 
applications, while using the additional 
processing power available from the 
latest parallel platforms. 

Led by The Open Group (Reading, 
England), the JEOPARD consortium 
includes four universities and research 
institutes: University of York (Eng- 
land), Vienna University of Technology 
(Austria), FZI (Research Center for 
Information Technologies at the Uni- 
versity of Karlsruhe, Germany) and the 
Technical University of Cluj-Napoca 
(Romania); three industrial manu- 
fecturers: EADS NV (Schiphol-Rijk, 
Netherlands), RadioLabs (Rome, Italy) 
and SkySoft (Lisbon, Portugal); and 



Curtiss-Wright Controls Embedded 
Computing has acquired Pentland Sys- 
tems Ltd. (Livingston, Scotland), for 
$2.3 million. Pentland was founded in 
1990 and its sales in 2008 are expected 
to be approxiinately $3.6 million. The 
deal should enhance Curtiss-Wright's 
sensor processing product portfolio 
adding RE/IF signal acquisition, analog, 
digital I/O and synchro/resolver prod- 
ucts for radar, sofhvare defined radio 
and signal intelligence applications. 

Pentland's products and technology 
will be organized under Curtiss-Wright 
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two embedded systems technology sup- 
pliers: Aicas (Karlsruhe, Germany) and 
Sysgo ( Klein- Winternheim, Germany). 

The strategic objective of the JEOP- 
ARD project is to provide the tools for 
platform-independent development 
of predictable systems that make use 
of symmetric multiprocessing (SMP) 
multicore platforms. These tools will 
enhance software productivity and 
reusability by extending processor tech- 
nology already established on desktop 
systems for the specific needs of tnidti- 
core embedded systems. 

The project will actively contribute 
to standards required for the develop- 
ment of portable software in this do- 
main, such as the Real-time Specifica- 
tion for Java (RTSJ). 

In addition, the JEOPARD project 
win develop a platform-independent 
software development interface for re- 
altime multicore systems. The interface 
will be based on existing technologies, 
including the Real-Time Specification 
for Java (JSR 1 and JSR 282) and Safety- 
Critical Java (JSR 302). 

These technologies currently pro- 
vide a strong foundation for the devel- 
opment of complex and highly reliable 
real-time systems, but they do not yet 
provide support for multicore systems. 
Even more challenging, some of the 



Controls Embedded Computing's 
Modular Solutions group, and will be 
managed by Rob Hoyecki, director of 
the digital signal processing division. 

"Pentland is a leading supplier of 
rugged high performance analog ac- 
quisition and reconstruction products 
for sonar, radar, signal intelligence 
and software defined radio apphca- 
tions in the aerospace and defense 
market," said Tom Quinly, president 
of Curtiss-Wright Controls Embedded 
Computing. "The addition of another 
incredibly talented and complemen- 



technologies can not address more 
than one processor at a time, making it 
impossible to develop applications that 
scale with the number of processors 
available on current and future multi- 
core systems. 

"The JEOPARD consortium 
partners are experts in all layers of a 
multicore embedded system, from the 
processor architecture to the Java vir- 
tual machine and the C and Java API 
layers," said Scott Hansen, director of 
European Projects at The Open Group. 
"We're confident the project wUl solve 
the critical issues in each of these layers, 
thereby providing a powerful interface 
for the appUcations developer at the 
highest level." 

"Embedded systems developers re- 
quire a stable platform based on recog- 
nized standards if they are to invest in 
new applications that fully exploit mul- 
ticore systems," said Fridtjof Siebert, 
CTO of Aicas and technical director for 
the consortium. "The JEOPARD proj- 
ect win provide an evolutionary path 
that maintains the advantages of Java 
such as portability, interoperabiUty, 
object orientated design, while enabUng 
real-time systems developers to achieve 
higher performance levels and scal- 
abiUty with increasingly sophisticated 
multicore platforms." 



tary technical team fits our strategy of 
continuing to expand our portfolio to 
offer the broadest and most tightiy in- 
tegrated rugged deployed COTS boards 
and subsystems available." 

Hoyecki added, "Their approach, 
which tightly couples the analog front- 
end with FPGA technolog)' is highly 
desired by our customers who require 
a defined signal bandwidth to be deliv- 
ered at the application level." The ac- 
quisition strengthens Curtiss-Wright's 
presence in Europe, adding to their 
existing video and integration business 
unit in Letchworth, which became part 
of the group in 2004 vrfth the purchase 
of Primagraphics for $21million. 
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BY BYRON MILLER 



DSP 



-based 



motor control using 

fuzz y log ic 



The increased use of variable-speed drive motors to reduce 
energy consumption will require a shift from PID controllers 
to systems based on fuzzy logic algorithms to simplify design, 
reduce development time, and eliminate complex math formulas. 



Variable-speed drive (VSD) motors 
provide hope for greatly reducing 
energy consumption and reliance on 
foreign fuels. In one approach, digilal 
signal processors (DSPs) are being 
used to create a new generation of VSD-based control- 
lers for motors such as brushless direct current (BLDC) 
motors. 

However, these motors present challenges. Control- 
ling motor speed on a BLDC motor is complcated when 
using traditional proportional, integral, and differential 
(PID) controllers because they rely on a complex math- 
ematical model and are computationally inteasive. An 
alternative approach is to use fuzzy logic (FL) algorithms 
to eliminate the need for complex math formulas and 
provide an easy-to-understand solution. FL motor con- 
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trol also has a shorter development 
q^cle compared to PID controllers, 
and thus a faster time-to-market. This 
article discusses the process of using 
FL algorithms to control BLDC motors 
using a Texas Instruments c28xx; fixed- 
point family of DSPs. 

BLDC CONTROL MODEL 
DEVELOPMENT 

Before constructing the PL engine, we 
must first develop a model to base the 
design on. FL controllers use heuris- 
tic knowledge and express the design 
using a linguistic description of the 
model. Rather than develop a model 
from scratch, we'll use the PID con- 
troller model as a starting point. Once 
developed and implemented, the FL 
controller is improved by adjusting its 
parameters. 

In general, there are three design 
steps for developing a FL BLDC con- 
troller: 

1. Define inputs, outputs, and the 
controller's range of operation. 

2. Define fiizzy membership set 
functions and rules. 

3. Tune the engine. 

Figure 1 shows the block digram 
of the BLDC controller model. 

The first step is to define the rel- 
evant inputs and outputs of the model. 
The inputs are the error (E), which is 
the current error between the set speed 
(SS) and the current speed (CS). The 
other input is the change in error (CE), 
which is the diflference between the cur- 



rent error, and the previously calculat- 
ed error (PE). The output is the change 
in armature voltage (CV), which is 
the difference between the current ar- 
mature voltage (CAV) and the stored 
value of the previous armature voltage 
(PAV). The resulting model equations 
are as follows: 

E--SS-CS 
CE = E-m 
CV=CAV-PAV 

Motor speed units are in revolu- 
tions per minute (RPM), and E deter- 
mines how close we are to the target 
speed. So for E>0 motor speed is 
below set speed. Alternatively, £ < in- 
dicates that the motor is spinning faster 
than the set speed. CE determines con- 
troller direction to adjust. CE is positive 
if and only if (iff) the current speed is 
less than the set speed. 

Alternatively, CE is negative iff the 
current speed is greater than the set 
speed. When doseio the set speed CE 
alternates between positive and nega- 
tive values. CV is the energizing voltage 
applied to the armature. This volt^e is 
expressed in implementation as a pulse 
width modulation (PWM) duty cycle. 

The next step is to define the fuzzy 
membership set functions, variables 
and rules. In order to work, non-fuzzy 
(crisp) inputs and outputs must be 
converted into fuzzy ones. Conver- 
sion is performed by using linguistic 
variables to represent input and output 
ranges. These are also referred to as 
fuzzy variables. Fuzzy variables are used 



to partition a region of values for mem- 
bership functions. For example, five 
variables are used to map the inputs 
and output. They are negative medium 
(NM), negative small (NS), zero (Z), 
positive small (PS), and positive me- 
dium (PM). The model's inputs and 
outputs are membership set functions 
that are described over the range of op- 
eration by the five fiizzy variables. 

Instead of math formulas, an FL 
controller uses fuzzy rules to make a 
decision and generate an output; how 
cool is that? FL rules are in the form of 
IF-THEN statements. 

Fuzzy rules determine system 
behavior, rather than complex math 
equations. For example, IF the error 
(E) is equal to NM and change in error 
(CE) is equal to PS, then the change in 
the armature voltage (CV) is equal to 
NS. 

The number of rules used is based 

on the experience of the designer and 
the knowledge of the system. Thus, for 
our system the number of rules used 
is 25, which is based on our basic PID 
controller model using the PID's con- 
trol surface. 

In order to energize the armature, 
the CV fiizzy output must be converted 
back to a crisp output. This process is 
called defiizzification. A popular meth- 
od of defuzzification called the center of 
gravity method is used; I'll discuss it in 
greater detail later. 

The last step of design is to adjust 
the membership functions and rules. 
This stage is also referred to as tuning. 
Tuning is used to improve the per- 
formance of the FL controller. Once 
designed, the controller is ready to be 
implemented. 

The FL controller implementation 
is made up of three modules. They are 
fiizzification, rule-base, and defuzzifi- 
cation. The following sections discuss 
the modules as related to the FL-BLDC 
implementation. 

FUZZIFICATION 

Fuzzificaton is the process of converting 
crisp TOlue data into fiizzy data. The 
resulting fuzzy data conversion is based 
on the degree of fuzzy set membership 
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The membership funcHon for fhe inpuf variables Error, and Cerror. 



OxIAAA' 















MM 






> 




\" . 





-0x5000 -0x3554 -OxIAAA 
Figure 2 



OxIAAA 0(3554 0(5000 



The fuzzified membership funcNon for fhe input- variable Error: XI []. 
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The fuzzified membership funcfion for fhe inpu^ variable Cerron X2[]. 
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The inference rule fable. 
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Table 2 

of input variables. For this applica- 
tion, motor control input variables are 
rotational error (Error) and change in 
rotational error (Cerror), which are 
taken from the PID controller model 
discussed ^lier. Error is the absolute 
error from one sample time to the next. 
Similarly, Cerror is the change in Error 



from one sample time to the next The 
formulas for each are: 

Error = SetSpeed - CurrentSpeed 
Cerror = Error - PreviousError 

As mentioned during the design 
section, five membership sets are de- 
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Fuzzified dais resulfing from Error 
equal fa 0x30 (x-axis) and Cerror 

equal lo 0x1 (x-axis). 
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Table 1 

fined for variables Error and Cerror: 

1. NM: Negative Medium 

2. NS: Negative Small 

3. ZE: Zero 

4. PS: Positive Small 

5. PM: Positive Medium 

Figure 2 shows the membership 
sets for variables Error and Cerror. The 
membership sets are triangular-shaped 
and overlap to provide good response. 
Each set has a maximum value of 
OxIAAA. 

This differs from typical fuzzy logic 
Uterature, which sets the maximum 
range equal to one. Using a maximum 
value of OxIAAA for the range reduces 
computational complexity. Specifically, 
the multiplying operation is reduced to 
either a series of additions or subtrac- 
tions rather than converting to and 
firom a floating point number. 

The resulting fuzzification of the 
input variables produces a vector with 
five components that correspond to 
the fuzzy membership sets NM, NS, 
ZE, PS, PM. The value (y-axis) of each 
component represents the degree of 
membership for that crisp input value. 
The vectors containing the fuzzified 
values of Error and Cerror are denoted 
by arrays XI [] and X2[] respectively. 
For example, with Error equal to 0x30 
(x-axis), and Cerror equal to 0x10 (x- 
axis), the resulting fiizzified data are 
shown in Table 1. 

Figures 3 and 4 graphically show 
the resulting fiizzified -^ues for Error 
and Cerror. 

FUZZY INFERENCE RULES 

Fuzzy inference rules operate on the 
fuzzified data to determine the system's 
behavior. Specifically, the fiizzified data 
is applied against the rule table. Lin- 
guistically, this is where the input data 
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LisHng 1 Inference rules pseudo code. 



MinValue = MaxValue = 
Y[O..N] = 0; 
For (1 = to N) 
{ 

for Cj = to M) 
{ 

MinValue = Min{Xl[I], X2[M]}; 

// compare each XI element to X2 element and store 
// smaller value 

// store max value found among X2 members 
If CMax{Xl[I], X2[M]} > MaxValue) // 
{ 

MaxValue = Max{Xl[I], X2[M]}; 
} 

// add max value to output vector function defined by 

// membership rules 

MaxValue += output_vector_member(x) ; 

} 

} 



The Dufpuf membership funcNon. 
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Figure 6 

Error and Cerror are compared with 
the rule table. The rule table contains 
membership set components NM, NS, 
ZE, PS, and PM, depending on the con- 
trol surface. The output is "inferred" 
from the valid or "fired" rules. The in- 
ference process is described by pseudo 
code in Listing 1. 

Table 2 shows the initial rule base 
for the motor control surface. 



The oufpul" membership funcHon mulKplied by vecfor YI). 



• 0x1 96A 



In the earlier example, output vec- 
tor Y[] is shown in Table 3. 

DEFUZZIFICATION 

Defuzzification is the process of con- 
verting fuzz\' data back into crisp value 
data. For the purpose of this applica- 
tion the defuzzified value determines 

the duty cycle of the PWM signal used 
to drive the motor. The duty cycle is 



determined hv using the modified cen- 
troid calculation function. The defuzzi- 
fication approach used here takes the 
centroid function and multiplies it by a 
coefficient. The modified calculation is 
also known as the centroid point calcu- 
lation function. This approach provides 
additional precision over the centroid 
calculation function. 

The centroid point calculation is 
obtained by the center point of the 
function that is the result of the mul- 
tiplication of the output membership 
function by the output vector Y[] . The 
formula for the centroid point calcula- 
tion is: 

Defiizzi&ed Value = 

X_Yli] X multCoefF[J] / 2 y[i] 

where Y[i] are the i-th elements of 
the output vector, and multCoeff[!] are 
the multiplying coefficients of the out- 
put membership function. The index i 
has a range of / = 1 to i = 5. 

Figure 5 shows the graphical rep- 
resentation of the output membership 
function used by this application with 
the coefficients of [-0x10, -0x8, 0x0, 
0x8,0x10]. 

Using the example output vector 
Y[] = [0x0, 0xl3F, 0xl96A, 0x0, 0x0], 
the following defuzzification output 
value is calculated: 

Defuz = X (-16) + 319 X (-8) 
+ 6506 X (0) -I- X (8) + X 
(16) / + 319 -h 6506 + -h 
= -2552 / 6825 => -0.37391 

Figure 6 shows the graphical repre- 
sentation of the centroid point calcula- 
tion for the output vector Y[] . 

HARDWARE DESCRIPTION 

The eZdsp2812 board is used in this 
motor control application. At the heart 
of the eZdsp board is the TMS320F2812 
DSP. The TMS320F2812 is a 150-MHz 
device that uses timer Tl running at 
20 kHz for generating PWMl-6 signals, 
and timer T2 running at 40 kHz for 
executing interrupt service routines 
(ISRs). Additionally, the input capture 
pins CAP 1-3 are used to collect speed 
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Demonstration application motor control block diagram. 



BLDC 
motor 




Figure 7 



Motor debug session. 




Figure 8 

data from the hall-effect sensors. 

Other members of the 28xx 
family may be substituted for the 
TMS320F2812. For instance, the 
eZdsp2808 board may be used if the 
timers driving the PWM and ISRs are 
changed. Specifically on the eZdsp2808 
board, EPWMl-3 is used for PWM 
generation, while CPU timer is used 
for an ISR interrupt source. Similarly, 
ECAPl-3 capture signals from the hall- 
effect sensors. 

The motor is driven with PWM 
signals generated by the DSP and trans- 
lated to a 3-phase output. The six PWM 
signals are used to source the 3-phase 
power inverter. The power inverter 



converts the six sigods-to a 3-phase 

signal that directly powers the motor. 
The 3-phase power inverter function is 
handled by an auxiliary motor control 
board. Spectrum Digital provides two 
boards that provide this function: the 
DMC550 and the DMC1500. Either 
board plugs directly into the eZdsp28xx 
board. 

Hall-effect sensors are used for 
feedback for the fuzzy logic control 
loop. The commutation instants for 
the 3-phase power inverter switches 
are determined by detecting edges 
from signals received from the hall-ef- 
fect sensors. The signals are fed into 
the TMS320F2812's capture pins and 



are debounced to eliminate noise or 
false edges from motor oscUlations. 
The actual motor speed is calculated 
by counting the edge-triggered sig- 
nals from the Hall-effect sensors via a 
software module. Figure 7 shows the 
hardware block diagram for controlling 
a three phase BLDC motor. 

SOFTWARE DESCRIPTION 

The motor control software is com- 
posed of DMC Library modules and 

the FL motor control routines. Seven 
of the DMClib modules are used in this 
application. They are: 

Datalog 
BLDC3PWM 

Hall3_Drv 
Mod6_Cnt 
Rmp2Cntl 

Ramp_Cntl 
Speed_PR 

Additionally, the FL motor con- 
trol is handled by a main FuzzyCtl () 
routine; this is FuzzyBLDCO for BLDC 
motors. When configured, these com- 
ponents demonstrate fuzzy logic con- 
trol of a variable speed motor. 

The software works by first per- 
forming configuration, then applica- 
tion-specific setup. Specifically, the 
GPIO pins are configured to act as 
PWMs and CAPture pins. Next the 
timers and module parameters are 
initialized, as well as ISR setup. After 
all peripherals are setup, interrupts are 
enabled, and the main control loop is 
entered. The main control loop calls 
the fuzzy controller once every 8.7 ms. 

The error values are converted 
through fuzzification into fuzzy values 
and stored in Xl[], and X2[J. Once 
converted, the fiizzified values applied 
to the fuzzy inference rules. The results 
from the inference module are stored 
in Y[]. Output from Y[] is then ap- 
plied to the defuzzification module to 
convert the fuzzy value back to a crisp 
value. The resulting crisp value is a 
PWM offset that is added to the current 
PWM duty cycle; creating a closed loop 
system. The updated PWM value is 
checked to see if the new value is wilitin 
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FUZZY LOGIC DEFINITIONS 

Centroid calculation function: used for producing an exact output 
value by calculating the center of gravity of the union of areas bound 
by membership functions and the input variable axes. 

Defuzzification: a general method for determining the best exact or 
"crisp" output of a given fuzzy set, defuzzification uses the centroid 
calculation function or a similar function to generate a crisp output. 

Fuzzification: a method for converting a "crisp" input value to a 
fuzzy membership function. The resulting fuzzy value is o member of a 

multivalued set. 

Fuzzy control system: a control system based on fuzzy IF-THEN 

rules that use fuzzy sets for input and output. 

Fuzzy Inference system: a collection of fuzzy IF-THIEN rules. 

Fuzzy logic: logic that uses linguistic variables to describe a system. 
Examples include: "fast," "slow," and "medium." 

Fuzzy set: any set that allov^s its members to hove different "grades" 

of membership. Eoch member may be expressed by a continuous num- 
ber betv/een zero and one: [0,1]. This contrasts to Boolean logic that 
limits set members to a value of either zero or one. 

Fuzzy system: a system v/hose variables range over states that are 
fuzzy sets. 

Membership function: the mapping of a fuzzy set that associates 
each set member with its grade of membership. 



bounds, and appropriate action is taken 
if it is not. Finally the fuzzy controller 
returns the updated PWM duty cycle 
count to the calling routine. 

Figure 8 shows a debug session of 
the demo application. Channels 1 &2 
(the display window in the upper-right 
hand corner) displays the PWM coun- 
ter, and the capture of the hall-effect 
sensors respectively. 

Channels 3 & 4 (the display win- 
dow beneath channels 1 & 2) show the 
edge-triggered versions of the PWM 
counter and the hall-effect sensors. 
The watch window displays important 
variables. Most important are SetSpeed, 
and Curr^tSpe^. These values m-e 
close enough so that the output of the 



fuzzy controller has a zero value. Also 
note the other values used in the con- 
troller process. This session shows the 
motor under no-load conditions. The 
behavior may be slightly different with 
a load. Moreover, if a finer granularity 
is desired it may be necessary to tune 
the controller. 

Byron Miller is an independent firmware 
engineer specializing in the design of mi- 
croprocessors, DSPs, hardware debug, 
porting, as well as the development of 
firmware for control, data acquisition, 
fuzzy logic, and Internet appliances. 
He has a BA in computer science and 
a masters in sofiweirie engineering. You 
can reach him at bmiller2@isd.net. 
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Combining a programmable solution with on industry-standard 
processor core can save time, money, and reol estate. 

Lower the COS! of 
intelligent power 

control with 



BY WENDY LDCKHART 



EPGAs 



The availability of ARM's 
Cortex-Ml 32-bit processor 
and single-cliip, mixed-signal 
FPGAs make possible the 
development of intelligent 
power control that could dramatically re- 
duce part count, board space, and system 
cost while increasing reliability, flexibility, 
and system availability. Methods exist for 
rapid prototyping and implementation of 
the hardware and software for server-based 
intelligent power; these BSeiiittds can also 
be applied to a range of energy-manage- 
ment applications. 

System considerations include de- 
velopment resources; tools available for 
development with Cortex-Ml and mixed- 
signal programmable system chips (PSCs); 
and availability of power control boards. 
Lowering the cost of intelligent power con- 
trol requires an tmderstanding of FFQi. 
implementation strategies, as well as a basic 
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understanding of designing with ARM's 
Cortex-Ml microprocessor and FPGA 
implementation tools. 

As the number of functions in a 
system grows, power to the system is 
no longer an afterthought. With time- 
to-market pressures and the need to 
support more features, designers must 
select peripherals from a standard set of 
components to meet their power and 
cost budgets. As a result, this may mean 
mixing an LCD with a 2.5-V supply 
and a ke)'pad with a 1.8-V supply. With 
many devices, the core and I/O volt- 
age within the device will also vary. So, 
within a single product, the power sup- 
ply may need to generate multiple volt- 
ages and possibly different sequences of 
the same voltage. 

In portable applications, the restric- 
tions can be even greater, requiring 
power management to extend battery 
life. Designs can therefore become quite 
complex, featuring multiple supplies, 
controlled ramp rates, power sequenc- 
ing, and complex supply management 
where suppUes are turned on and off as 
needed. 

Intelligent power control has ex- 



isted in some high-end systems as a 
custom implementation or more re- 
cently with standards, such as ATCA 
and MicroTCA. With the market now 
demanding smaller, portable versions 
of many applications, the power con- 
trol must be scaled down as well, creat- 
ing a need for intelligent power control 
that's low cost, small form factor, and 
low power. 

InteUigent power control involves 
the following basic aspects: 

• Generating aU the required system 
voltages. 

• Sequencing each device's power 
up and down to maintain system 
integrity and prevent issues such 
as latch-up, inrush current, or I/O 
contention. 

• Suppartii^ lh« ability to switch off 
certain devices when not needed 
and power them back up for seam- 
less operation. 

• Maintaining minimum function- 
ality in standby, with the ability 
to wake at certain intervals or on- 
demand. 

Implementing these functions in an 



appUcation-spedfic standard product 
(ASSP) would require a standard power 
profile. The use of a programmable 

device, however, allows for adaptation 
to multiple system requirements. Pro- 
grammable devices also allow for fiiture 
system upgrades and fine tuning. A 
number of programmable power mod- 
ules are available, but they still need to 
be combined with some form of brain 
to tell them when to turn each supply 
on and off. 

As shovm in Figure 1, mixed-sig- 
nal PSCs allow for the combination of 
programmable power generation with 
real- time analysis and control. Com- 
bining functions for power sequencing, 
monitoring, and control in a single chip 
results in significant cost savings in 
terms of materials list, board space, and 
buUd costs. In addition, a single device 
consumes less power and can manage 
the power supphes for the rest of the 
system. And, if the PSC can perform 
wake up from a watchdog timer or ex- 
ternal trigger, the entire system can be 
shut down except for one device. Some 
PSCs can also manage smart battery 
charging, which is another intelhgent 
way to enhance power control in the 
system. 

Early adoption of PSCs in intel- 
ligent power control apphcations has 
been facilitated by the use of reference 
designs, some in the area of ATCA and 
others in the associated intellectual 
property (IP), such as Intelligent Plat- 
form Management Interface (IPMI). 
This platform-based approach lets de- 
signers start from a known s)'stem and 
customize it, or use a fixed standard 
and work it into their system. Some 
system management development kits 
also provide examples for a more cus- 
tom approach to power control. This 
still requires a clear understanding of 
the PSC as well as a custom develop- 
ment environment. 

Because the PSCs core includes 
standard flash-based FPGA gates, the 
Cortex-Ml core can be used as part of 
the design. This provides significant 
benefits to an existing ARM designer 
who must integrate intelligent power 
control without having to learn a 
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full suite of analog tools. Cortex-Ml 
is fully backwards compatible with 
arm's Thumb architecture, so design- 
ers can port existing application code 
to the PSC. 

Although implementing a Cortex- 
Ml design on a PSC might be a new 
challenge to some engineers, certain 
tools and techniques can make this a 
relatively painless design cycle. Often 
the easy approach from a time-to- 
market point of view is the expensive 
approach from a cost point of view. But 
when the design tools are created by 
a silicon supplier, the motivation is to 
help the designer get to silicon as fast as 
possible. 

mPLEMENTING INTELUGENT 

POWER CONTROL 

Standard FPGA design can take several 

different forms: 

• Full HDL code or schematic 



starting from scratch. 

• Combination of existing versions 
of design plus some new code 
and features. 

• Combination of purchased or 
custom IP, existing design, and 
new code. 

The same styles apply to both 
mixed-signal FPGAs and soft ARM 
processors on FPGAs. 

The hardware and software imple- 
mentation steps are shown in Figure 
2. To simplify terms, the hardware 
implementation involves designing the 
processor and programming it to the 
FPGA. The software implementation 
involves developing code for the pro- 
cessor OH the FPGA to complete the 
application. 

First, the designer identifies avail- 
able industry-standard processors 
(8051 or ARM, for example) for the 
design and then looks through offer- 



ings from each supplier to find the 
best hst of peripherals for the design. 
If the perfect list of peripherals can't 
be found, then a CPLD or compan- 
ion FPGA can be added to the board. 
When implementing a soft processor 
on an FPGA, the supplier should pro- 
vide a list of available peripherals. 

Ideally, the user selects the proces- 
sor from a list. Then, the bus type, 
which is often dictated by the proces- 
sor, is chosen. Once the bus is selected, 
the user simply drags and drops pe- 
ripherals into the design. Peripherals 
may include interfaces such as serial 
peripheral interface, universal serial 
bus, timers, or UARTs. Unlike an ASSP 
processor, users can add any combina- 
tion of peripherals until they run out 
of peripheral ports. The tool should 
provide the ability to auto-connect pe- 
ripherals where possible and allow users 
to configure all other parameters and 
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connections through a simple GUI. 

Having designed the processor, 
users then generate and export to the 
FPGA layout tool. The processor may 
only be part of the FPGA design. The 
design may include additional glue log- 
ic or interfaces that aren't controlled 
by the processor. Using a simple block 
design tool, users can drop in the 
processor and any additional blocks 
to create a single top-level design for 
synthesis. 

The design then follows a standard 
FPGA flow: synthesis, place and route, 
timing analysis, and so forth. Ideally, 
the Cortex-Ml processor should be de- 
livered with a custom layout from the 
silicon vendor, iridicating that timing 
analysis and optimization have been 
done. This ensures maximum perfor- 
mance in a mitumimi footprint. The 
layout should be customized to the 
device you plan to use. From here, the 
custom processor can be programmed 
to the PSC. 

BOARD OPTIONS 

The choice of development environ- 
ment is important. Buying a "demo 
kit" usually means you can get the de- 
sign or board to do what the supplier 
intended. What you really want is to 
convince yourself that both the tech- 
nology and your design have the ability 
and functional accuracy to eliminate 
the need to build custom prototypes 
- or at least skip the first round of pro- 
totyping. 

Look for a board that has the inter- 
faces built-in, and has enough I/O avail- 
able with connectors to work with your 
regular platform or peripheral board. 

In addition, key components, such 
as crystals that can switch out, increase 
the board's usability as a true prototyp- 
ing environment. You should also be 
able to program and debug on the de- 
velopment platform. 

Software implementation is similar 
to a standalone processor. The tool 
that builds the processor should ex- 
port the appropriate files required by 
the compiler to create the application. 
Existing functions can be exported 
and enable the ability to work from 



a library of functions to control the 
peripherals. Custom code can then be 
added on top of this. 

For many processors, including 
Cortex-Ml, the GNU environment 
offers free tools including compilers 
and debuggers. However, extra money 
does get you more mileage. 

Generally, the more expensive 
compilers will provide either better 
performance or less code space. Still, 
free compUers will get you a func- 
tionally accurate design and a long 
way through prototyping. If you're 
working with a lot of library func- 
tions, the functions should already be 
optimized, eliminating the need for 
e!q)ensive compilers. 

Debug is the next phase of design. 
With a soft processor, the ability to 
debug both the hardware and software 
is important. Again, a pre-optimized 
version of the processor helps, as pro- 
cessor debi% W0ii*t be necessary. 

Some tools are designed to coex- 
ist, such as ModelSim, which runs the 



hardware, and GNU, which runs the 
processor application. It's best to get 
these tools from your silicon supplier, 
because the PSC is unique and other 
debug tools won't be set up to handle 
the architecture. Also, with a standard 
ARM processor, existing ARM debug 
tools support debug of the processor 
application, but not any additional 
FPGA logic. 

All of the methods and strategies 
described apply to intelligent power 
control, whether the application is a 
simple handheld portable device or 
for a multiserver rack system. The 
desire to conserve energy and cost will 
continue to drive electronics in this 
directioa, vMie also driving designers 
to meet these challenges for years to 
come. 

Wendy Lockhart is a principal engineer 
at Actel. She holds a master's degree 
in electronics from the University of 

Edinburgh, Scotland. Lockhart can be 
reached at wendy.lockhart@actel.com. 
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The system virtual machine can be used to make 

Linux-based applications faster and more responsive 
and secure. Here's a primer to get you started. 



Virtualizing embedded 



BY CERNOT HEISER 



Luuix 



Lintix is rapidly becoming the operating system of 
choice in a wide array of embedded appHcations, 
ranging from mobile handsets and network/tele- 
com infrastructure applications to media-rich 
consumer electronics de- 
vices such as portable media players and 
digital video systems. 

Many embedded systems developers 
are already using Linux in their designs 
or are considering doing so. The percep- 
tion among developers is that it's easier to 
develop applications for Linux than it is to 
develop for proprietary operating systems 
and that using Linux reduces costs because 
it's open source. 

However, Linux still presents a number 
of problems in the embedded space. 

First, the nature of the programming 
environment is often bifurcated. For ex- 
ample, in many media-rich consumer ap- 
plications, Linux is used to run high-level 
application code that is similar - and often 
identical - to application code used on 
personal computers. Such code is typically 
developed by application programmers 
who normally are not experts in program- 
ming low-level embedded systems. 

But such appHcations have much real-time functionality that 
requires low and predictable interrupt latency. In the case of the 
mobile phone terminal, the cellular communication subsystem 
has real-time requirements. And while embedded Linux has cer- 
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tainly improved, these requirements 
are best met by a small and highly 
efficient real-time operating system 
(RTOS). 

Second is the problem of security. 
In a mobile-phone handset, for ex- 
ample, the communication stack is of 
critieal importance - if it is subverted 
by an attacker, the phone could be 
turned into a jammer that disables 
communication in the whole cell. Simi- 
larly, an encryption subsystem needs 
to be strongly protected from being 
compromised. 

It's no insignificant challenge to 
create a secure system that runs mil- 
lions of lines of code; inevitably, the 
code contain tens of thousands of bugs, 
many of which can compromise the 
system's security. Increasingly prone to 
attacks, embedded Linux implementa- 
tions are large enough (hundreds of 
thousands of lines of code) to contain 
as many as a thousand bugs. Because 
the Linux operating system normally 
nms in privileged mode, once it is com- 
promised, attacks on any part of the 
system are possible. 

Third is the issue of license separa- 
tion. Linux is a frequently deployed 
high-level operating system. Among its 
advantages are the royalty-free status, 
independence from specific vendors, 
widespread deployment, and a strong 
and vibrant developer community. 

A frequent concern about Linux 
is that it's distributed under the GPL 
license, which requires that all derived 
code is subject to the same license and 
thus becomes open source. Some legal 
arguments claim that the license ap- 
plies even to device drivers that are 
loaded into the kernel as binaries at 
run time. 

This restriction creates a potential 
problem for chipmakers who consider 
device interfaces valuable proprietary 
IP. An open-source device driver will 
effectively publish those device inter- 
faces, a strong disincentive for using 
Linux in many embedded systems 
scenarios. 

As is the trend with desktop ap- 
plications developers, many embedded 
systems developers are looking to the 
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use of system virtualization environ- 
ments, also called system virtual ma- 
chines, to resolve, or at least minimize, 
such problems. 

Unlike process virtual machine 
environments specific to particular 
programming languages, such as the 
Java VM, system virtual machines cor- 
respond to actual hardware and can 
execute complete operating systems in 
isolation from other similar instantia- 
tions in the same computing environ- 
ment. 

This article will explain embedded- 
system virtual machine models and ex- 
plores where and how they can be used 
to make Linux-based applications faster 
and more responsive and secure. 

THE BASICS OF VIRTUALIZATION 

Virtualization refers to providing a 
software environment in which pro- 
grams (including operating systems) 
can run as if on bare hardware, as Fig- 
ure 1 shows. Such an environment is 
called a virtual machine. A virtual ma- 
chine is an efficient, isolated duplicate 
of the real machine. 

The software layer that provides 
the virtual machine environment is 
called the virtual machine monitor 
(VMM), or hypervisor. The VMM has 
three essential characteristics: 

I. It provides an en\'ironment for 
programs that is essentially identi- 



cal to the original machine; 

2. Programs that run in this environ- 
ment show, at worst, minor de- 
creases in speed; and 

3. The VMM is in complete control 
of system resources. 

All three characteristics are im- 
portant and contribute to making 
virtualization highly useful. The first 
(similarity) ensures that software that 
runs on the real machine will run on 
the virtual machine. The second (ef- 
ficiency) ensures that virtualization is 
practical from the performance point 
of view. 

The efficiency feature requires that 
the vast majority of instructions be 
directly executed by the hardware: any 
form of emulation or interpretation 
replaces a single virtual-machine in- 
struction by several instrucdons of the 
underlying hardware. 

This requires that the virtual hard- 
ware be almost identical to the physical 
hardware on which the VMM is hosted. 
Small differences are possible, such as 
the physical hardware may miss some 
instructions of the virtual hardware (as 
long as they aren't heavily used), the 
memory-management unit may be dif- 
ferent, or devices may differ. 

However, not all instructions can 
be direcdy executed. The resource-con- 
trol feature requires that all instructions 
that deal with resources must access 
the virtual rather than the physical re- 
sources. This means such instructions 
must be interpreted by the VMM, as 
otherwise virtualization is violated. 

Specifically, the virtual machine 
must interpret two classes of instruc- 
tions: 

1 . control-sensitive instructions 

modify the privileged machine 
state and therefore interfere with 
the hypervisor's control over re- 
sources; and 

2. behavior-sensitive instructions 

access (read) the privileged 
machine state. While these in- 
structions can't change resource 
allocations, they reveal the state 
of real resources, specifically 
when they differ from the virtual 
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resources and therefore break the 
illusion provided by virtualiza- 
tion. 

BENEFITS OF VIRTUALIZATION 
The key attraction of virtualization for 
embedded systems developers is that it 
supports the concurrent existence and 
operation of multiple operating systems 
on the same hardware platform. 

Virtualization helps overcome the 
challenges caused by the biftircated 
programming environments by run- 
ning appropriate operating systems 
concurrently on the same processor 
core, as shown in Figure 2. The same 
effect could be achieved by using sepa- 
rate cores for the real-time and applica- 
tion software stacks, combined with 
hardware mechanisms for partitioning 
memory. 

The ability to run several concur- 
rent operating systems on a single 
processor core may reduce the bill of 
materials, especially for lower-end de- 
vices. It also provides a uniform operat- 
ing system environment in the case of 
a product line (composed of high-end 
devices using multiple cores as well as 
lower-end single-core devices). 

Virtualization can also be used 
to enhance security. A system virtual 
machine encapsulates a subsystem, so 
that its failure can't interfere with other 
subsystems (note that this encapsula- 
tion is courtesy of the aforementioned 
resource-control requirement). 

TYPES OF VIRTUALIZATION 

There are two basic ways to ensure 
that code running in the virtual ma- 
chine doesn't execute any sensitive 
instructions: I) pure virtualization, 
which depends on sensitive instruc- 
tions not being executable by the vir- 
tual machine and 2) para-virtualiza- 
tion, where sensitive instructions are 
removed from the virtual machine. 

Pure virtualization 

The classical approach is pure vir- 
tualization, which requires that aU 
sensitive instructions be privUeged. 
Privileged instructions can be executed 
in a privileged state of the proces- Figure 3 
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sor (typically called privileged mode, 
kernel mode, or supervisor mode) but 
generate an exception when executed 
in unprivileged mode (also called user 
mode). This is shown in Figure 3. An 
exception enters privileged mode at a 
specific address (the exception han- 
dler), which is part of the hypervisor. 
The hypervisor can then interpret 
("virtualize") the instruction as re- 
quired to maintain virtual machine 
state. 

Until recently, pure virtualization 
was impossible on almost all contem- 
porary architectures, as they all fea- 
tured sensitive instructions that were 
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not privileged. Recently many of the 
major processor manufacturers - in- 
cluding Intel and AMD in the desktop 
space and ARM in the embedded 
market - have added virtualization ex- 
tensions that allow the processor to be 
configured in a way that forces all sen- 
sitive instructions to cause exceptions. 

Despite this, there are a number 
of reasons this approach to virtualiza- 
tion is not generally used, especially 
in embedded applications. One is that 
exceptions are expensive. 

On pipelined processors, an excep- 
tion drains the pipeline, resulting in 
delay in processing, typically one cycle 
per pipeline stage. A similar delay 
typically happens when returning to 
user mode. Furthermore, exceptions 
(and exception returns) are branches 
that usually are not predictable by a 
processor's branch-prediction unit, 
resulting in additional latency. 

These effects typically add up to 10 
to 20 cycles, more in deeply pipelined 
high-performance processors. Add to 
this the work required for the actual 
instruction emulation, we can see that 
virtualizing a single instruction costs 
dozens of cycles. Some processors 
(notably the x86 family) have excep- 
tion costs that are much higher than 
this (hundreds of cycles). This creates 
substantial overhead for operating 
system's code, which frequently ex- 
ecutes many privileged instructions in 
a short time. 

Para-virtuaUzation 
In this approach, the source code can 
be manually modified to remove direct 
access to privileged state and instead 
replace such accesses by explicit in- 
vocations of the hypervisor ("hyper- 
calls"). 

Para-virtualization allows replace- 
ment of many sensitive instructions 
by a single hypercail, thus reducing 
the number of (expensive) switches 
between unprivileged and privileged 
mode. 

If properly implemented, para-vir- 
tualization has the potential to reduce 

the virtualization overhead. Variants 
of para-virtualization have been de- 



SiTuchjre of monolithic and microl<ernel-based systems. 





Application 


•••• syscall 


* 



IPC, file system 

Scheduler, virtual memory 

Device drivers, dispatctier, ... 



Hardware 



User 
mode 



Kernel 



Application 


1 




File 







IPC, virtual memory 



Hardware 



Figure 4 

ployed for years by VMWare and Xen 
from the University of Cambridge, 
both aimed at the enterprise market. 

Recently virtualization solutions 
aiming at embedded designs have 
emerged, such as L4/Wombat from the 
University of New South Wales, and 
the commercial systems Trango, VLX 
from VirtualLogix as well as OK4 from 
Open Kernel Labs. 

All have their advantages and dis- 
advantages in different environments, 
but what most of these approaches all 
have in common is that they intro- 
duce another layer of software - and 
complexity - into the operating envi- 
ronment. And in many embedded ap- 
plications, where code size frequently 
measures in the millions of lines of 
code, breaking this into two or three 
virtual machines is of limited help for 
improving overall system reliabiUty 
and security. 



The isolation provided by current 
approaches to para-virtualization is by 
its nature coarse-grained - it provides 
the appearance of a complete machine 
for each subsystem. This means that 
each virtual machine is required to 
run its own operating system, making 
them relatively heavyweight. 

Consequently, increasing the num- 
ber of virtual machines in order to re- 
duce the granularity of the subsystems 
would create serious performance 
issues and significantly increase the 
amount of code. TTiis in turn not only 
requires increased memory size and 
thus power consumption, but also re- 
sults in more points of failure. 

USING A MICROKERNEL 
ASAHYPERVTSOR 

The most appropriate way to deploy 
para-virtuaUzation in an embedded 
design is by integrating it into the 



structure of the operating system. 
But as Linus Torvalds commented 

recently, integrating it into Linux is 
not practical because there is no one- 
size-fits-all "One True VirtuaUzation" 
model appropriate to all the applica- 
tions in which Linux is being used. It 
also wouldn't help for those deploying 
devices without Linux (because they 
want to oifer a choice of high-level 
operating systems, or maybe because 
they ship low-end devices in which a 
high-level operating systems makes no 
sense). 

A better place to implement an 
integrated para-virtualization mecha- 
nism is in the companion RTOS used 
to handle the hard real-time opera- 
tions that Linux can't. This approach 
requires more than a bare hypervisor: 
it needs a kernel that can provide basic 
operating-system mechanisms, such 
as a high-performance message-based 
microkernel, as Figure 4 shows. 

Unlike the classical monolithic 
operating system structure of Linux 
with a vertical structure of layers, each 
abstracting the layers below, a micro- 
kernel-based system, exhibits a hori- 
zontal structure. System components 
run beside application code and are 
invoked by sending messages. 

A notable property of a microker- 
nel system is that as far as the kernel is 
concerned, there is no real difference 
between "system services" and "ap- 
plications" - all are simply processes 
running in user mode. Each such user- 
level process is encapsulated in its own 
hardware address space, set up by the 
kernel. 
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A subsystem running on a micro- 
kernel can only affect other parts of 
the system (outside its own address 
spaces) by invoking kernel mecha- 
nisms, particularly IPC. It can only 
directly access memory (or another re- 
source) if it's mapped into its address 
space via a system caU. 

This model is a good fit for em- 
bedded systems, where the distinction 
between "system services" and "ap- 
plications" is frequently meaningless 
due to the cooperative nature of the 
interaction of subsystems. 

The central mechanism provided 
by a microkernel is a message-pass- 
ing communication mechanism called 
IPC. 

In the horizontal system structure, 
IPC is used for invoking all system ser- 
vices, as well as providing other com- 
munication between subsystems. Ow- 
ing to the need for high-bandwidth, 
low-latency communication, a micro- 
kernel typically also provides mecha- 
nisms for setting up shared memory 
re^ons between processes. 

RIGHT MECHANISM 

In this context, a microkernel provides 

the right mechanisms for efficiently 
supporting virtualization. The micro- 
kernel serves as the hypervisor, which 
catches virtualization traps. Contrary 
to other virtualization approaches, the 
microkernel forwards the exception to 
a user-mode virtual machine monitor, 
which either performs the emulation or 
signals a fault. 

The IPC is also the enabler for 
low-overhead virtualization: a sys- 
tem-call trap executed by a guest ap- 
plication in a virtual machine invokes 
the microkernel's exception handler, 
which converts this event into an IPC 
message to the guest operating system. 

The guest handles it as a normal 
system call. The system-call result is 
returned to the guest application via 
another IPC message, which unblocks 
the waiting guest process. 

Similarly, IPC is used to deliver 
interrupts to the guest operating sys- 
tem's interrupt handler, h's also used 
to communicate with device drivers. 
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Table 1 

and for communication and synchro- 
nization between any components of 
the system, including between virtual- 
machine environments. 

As the same IPC mechanism is 
used for many different operations, it's 
typically highly optimized. This im- 
plicitly benefits virtualization as weU 
as other critical system operations. As 
a well-designed IPC mechanism is also 
very simple, it's po^Me to optimize it 
in virtually all of its aspects. 

A WlACnCAL AFPLIGATION 

To see how this more fine-grained ap- 
proach to virtualization has an impact 
on how an embedded systems designer 
works, let's look at a media player 
design, originally hosted on a more 
traditional para- virtualization environ- 
ment with Linux as the guest operating 
system. 

The design is then ported to run in 
its own address space as a native appli- 
cation under the open source, but com- 
mercially supported, message-based 
OKL4 virtualization-ready microkernel. 

The media player can then run 
side-by-side with the Linux system 
(that still supports other applications) 
but also with a trusted crypto service 
that runs in a minimal trusted-com- 
puting-base environment. Over time, 
more components can be extracted 
from their monolithic environments, 
be it a high-level operating system or 



an RTOS running a communications 
stack, into their own protected com- 
partments. 

This includes device drivers, net- 
work stacks, file systems and other 
functional components. Such an ap- 
proach can dramatically improve the 
robustness of the system by introducing 
internal protection boundaries that 
contain the damage caused by bi^. 

For more than 10 years, L4 has 
been successfully used as a hypervi- 
sor for virtualizing Linux. As shown 
in Table 1, the performance of open- 
source OKL4-based virtual machines 
is typically within less than 5% of the 
native performance. A particularly in- 
teresting result is that of Linux on ARM 
platforms, where OK Linux (Linux vir- 
tuahzed on OKL4) outperforms native 
Linux in Imbench context-switching 
and other microbenchmarks by factors 
of up to 50. 

Gemot Heiser (gernot@ok-labs.com) is 
cofounder and chief technology officer of 
Open Kernel Lobs (OK). Prior fo founding 
OK, he created and led the Embedded, 
Real-Time and Operating Systems (ER- 
TOS) research program at NICTA, the 
Australian national centre of excellence 
for information and communications 
technology. He is also a professor at the 
University of New South Wales. Gemot 
Heiser holds a PhD in computer science 
from ETH Zurich, Switzerland. 
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Back to the future: 

Manchester 

when commercial options fail, try using Manchester 
encoding and other time-tested protocols in low-cost, 
low bit-rate serial communications. 
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When developing an embedded system that 
reqidto ba^ serial network communica- 
tions, the list of design options are plentifiil. 
But when all of 
«gttll-Ae^ih*lf 
alternatives fail to meet your requirements, 
it's worth remembering that traditional and 
well-tested protocols, such as Manchester 
encoding, offer some unique alternatives to 
the current commercial methods of wired 
and wireless control-data transaQissi0ii. 

The off-the-shelf choices are numerous. 
Many of today's microcontrollers offer such 
a wide variety of network peripherals that 
the possibility of not finding the right one 
for your application seems highly remote. 
Whether your system architecture requires a 
peer-to-peer or multi-drop topology, stan- 
dards such as RS-232 and RS-485, to name 
a few, have loyally served the embedded 
development community for a long time. In 
addition, today's technology has expanded 
to include CAN bus and Ethernet as possible 
embedded network solutions. 

For applications that are cost-sensitive, 
have a low data rate, and are physically constrained, some component 
manufacturers offer a variety of low-cost devices that interface to a 
simple network scheme, Dallas Semiconductor has taken it one step 
fcF&er and desipied a family of comptiwits that communicites w 
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a proprietary one-wire network. This 

novel design provides power and com- 
munications all on a single wire. These 
devices work well for low-cost designs 
where the power required from the 
single wire is not significant. 

In wireless network applications, 
nuinerous solutions based on the use 
of the ZigBee protocol atop the IEEE 
802.15.4 standard for physical layer 
and medium access control in low-rate 
wireless personal area networks are 
available off-the-shelf from companies 
as diverse as Amber, Freescale, Micro- 
chip, and Texas Instruments, among 
others. 

In addition, Microchip has an even 
lower bit-rate solution, combining its 
more sparse proprietary MiWi pro- 
tocol atop the 802.15.4 physical layer. 
Freescale, in addition to its fiill ZigBee 
products, offers developers alternatives 
that use lower bit-rate protocols atop the 
802.15.4 physical layer. 

WHEN OFF-THE-SHELF 
IS NOT ENOUGH 

But even with so many options, both 
wired and wireless, available off the 
shelf, there are a significant number of 
networked control designs where cost 
and physical size continue to place a 
heavy burden on our designs. In those 
cases, the usual list of suspects may not 
be viable after all. 

Embedded developers are then 
challenged to dig deep and find solu- 
tions that satisfy their performance re- 
quirements, while maintaining a small 
physical size and a lower product cost, 
all without sacrificing data integrity. 

For those systems requiring more 
power than is capable from one-wire 
solutions, standard off-the-shelf com- 
ponents may not be feasible. The solu- 
tion may require a custom design that 
strives to balance functional require- 
ments with a number of significant 
constraints. 

When cost constraints are tight and 
performance requirements less stringent, 
it might be worth your while to use a 
"back to the fiiture" strategy and con- 
sider die use of a well-Jcnown and well- 
tested serial communications protocols 
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Figuris 1 

such as Non-Return to Zero (NRZ) or 
Manchester encoding as a possible solu- 
tion to your embedded design. 

All Ji^iat ^ial communications 

share one thing in common; they all 
send a series of ones and zeros. From 
thisi point, the dififeriatiating factor 
becomes the level of the signal, the or- 
ganization of the data, how the data is 
encoded, and how the data is synchro- 
nized. In a design that is sensitive to 
component availability and overall cost, 
it's highly desirable that serial data is 
transferred with a minimum number of 
wires and is built from readily available 
components. 

NON-RETURN TO ZERO (NRZ) 

One of the simplest ways to transmit 
digital data is by having a separate 
clock and data line. In this approach, 
a clock signal of constant frequency is 
synchronized with its corresponding 
data. Depending upon the preference of 
the designer, the data is either latched 
on the rising or &lling edge of the 
clock. Figure 1 illustrates a typical NRZ 
implementation. 

In this example the binary string 
"101001 1" is being transmitted. Each 



1_ 
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bit is identified on the rising edge of the 

clock signal. As simple as this process 
is, you need to consider at least three 
fundamental drawbacks: 

1 . An additional line must support 
the clock signal. This is required to 
latch the data accurately. Depend- 
ing upon the quality and length of 
the transmission line, additional 
circuitry may be required to pro- 
vide the proper drive capability. 

2. The second problem occurs if you 
decide to eliminate the clock signal. 
In this scenario the receiver will 
require an internal clock that is in 
near-perfect synchronization with 
the transmitter. Any phase shift 
between the transmitter and the 
receiver can cause bit errors to oc- 
cur. This may seem a trivial matter. 
However, when the clock frequency 
becomes high enough, the sensitiv- 
ity to phase differences between the 
transmitter and receiver becomes 
more significant. 

3 . A third limitation occurs when an 
NRZ transmission contains a long 
string of ones or zeros, as shown in 
Figure 2. This can make the syn- 



A limil-3hon occurs when an NRZ fransmission conl-ains a long 
string of one's or zeros. 





No discemoble bit boundaries making it difficult 
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Two diffefenl- apHons for encoding fhe dafa. 

Option A 

OR 



Option B 
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chronization between tiie transmit- 
ter and receiver even more sensitive 

to bit encoding errors. Since the line 
is in one state for a relatively long 
period of time, there are no transi- 
tions. Without transitions on the 
data line, it becomes impossible to 
see where a bit boimdary is located. 
This can result in erroneous data 
encoding. 

The next section describes an en- 
coding scheme that adckesses these 
issues and provides a unique alterna- 
tive to the traditional method of data 
transmission. 

MANCHESTER ENCODING 
Manchester encoding offers distinct 
advantages over other digital encod- 
ing schemes. It has become a popular 
Standard for low-cost radio frequency 
communication of digital data. Even 
Ethernet employs Manchester encod- 
ing that was used to deliver this article 
to your computer (if you're reading 
this online). So what exactly is Man- 
chester encoding and how can it be 
used effectively in a low-cost embed- 
ded systems design? 

Manchester encoding was first 
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developed back in the late 1940s at the 
University of Manchester in Manches- 
ter, England. Given the time period 
and location, one with a prochvity for 
history might be inclined to believe its 
development was perhaps a by-product 
of research done by an obscure World 
War II code-breaker at England's infa- 
mous Bletchley Park. 

In reality, Manchester encoding 
was the resuit of is^earch done at the 
University of Manchester into phase 
modulation techniques used for read- 
ing and writing digital data onto a mag- 
netic storage device. Since that time, 
Manchester encoding has gained wide 
acceptance as the modulation scheme 
for low-cost radio-frequency transmis- 
sion of digital data. 

One of the most dgm&mut charac- 
teristics of Manchester encoding is its 
unique way of representing digital data. 
Rafh€:r tiian representing data based on 
a particular level, Manchester encoding 
uses transitions (see Figure 3) to iden- 
tify a binary one or zero. In more tra- 
ditional encoding schemes, a separate 
clock signal determines when to sample 
the data line. Manchester encoding uses 
one signal to identify the data. 

Figure 3 shows two different op- 



tions tor encomng tne data, mere nas 
been some debate over which scheme 
is more advantageous. The Ethernet 
and IEEE standards describe Option B 
as the method for encoding data, while 
the original Manchester-encoded speci- 
fication described Option A. Confusion 
may occur when looking at the \a\x- 
age signals on a Manchester-encoded 
line. Most line drivers will invert their 
signals, which have led some to believe 
that Option B is being implemented. 
In the end, when developing your own 
custom-designed embedded system, the 
choice as to which option is best may 
be purely academic. However, once 
a choice has been made, consistent 
implementation is a necessity. 

With this information, let's con- 
struct a Manchester-encoded bit stream 
using Option A. We will use the binary 
bit pattern shown in Figure 1. The first 
thing to do is establish what are called 
bit boundaries. These are the points in 
time where a level transition occurs. Bit 
boundaries are analogous to one clock 
period in an NRZ scheme. It's these bit 
boimdaries that define the points where 
Manchester encoding of the individual 
bits will occur. 

Figure 4 illustrates the bit str^un 
constructed using Manchester encod- 
ing. Each point where you see an arrow 
is defined as the bit boundary. The ar- 
row indicates the direction of the tran- 
sition. So, using Option A from Figure 
3, the first binary "1" bit is translated by 
a transition from a high to a low level. 
One clock period later, another transi- 
tion occurs. This time a binary "0" bit 
has been encoded by a low- to hi^- 
level transition. 

Further down the bit stream, you 
will notice a transition that looks out 
of place. It occurs at a point halfway 
between two clock periods. This is 
called the setup point. The reason for 
the setup point is to ensure the signal is 
at the correct level prior to the next bit 
boundary. 

CONSTRUCTION OF MANCMESTER- 

ENCODEDDATA 

Manchester encoding is very easy to 

construct. You simply combine the se- 



The bit- stream consl^rucfed using Manchester encoiing. 
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To creal-e Manchester encoding, simply combine ^e serial bits to be encoded witti ttie clock running 
at ttie bit-boundary rate. Ttie Manchester-encoded output below has the same waveform as with the 
bit stream in Figured . 



Bit stream 
Clock 




Manchester- encoded output 



Clock 



Bit stream 



J 



Manchester- 
encoded output 



Figure 5 

rial bits to be encoded with the clock 
running at the bit-boundary rate (see 
Figure 5). When you compare the 
Manchester-encoded output in i-igure 
5 with the bit stream in Figure 4, you'll 
see the same waveform. 

DECODING MANCHESTER- 
ENCODED DATA 
Decoding Manchester-encoded data 
is as easy as encoding it. You simply 
perform an exdusive-OR of the Man- 
chester-encoded signal with a logical 



"1" at the bit-boundary sample points, 
as shown in Figure 6. 

If you prefer an analog solution 
for decoding. Figure 7 shows a simple 
circuit that can achieve the same results. 
The beauty of this circuit, sometimes 
called a data-slicer, is that it doesn't re- 
quire a synchronizing clock. This elimi- 
nates the possibly of errors caused by 
clock jitters or mismatches between the 
transmitting and receiving dock signals. 
The only issue is the valttes selected for 
the resistor and caiadtor. They must be 



selected so that the RC-time constant is 
longer than 'A the bit-boundary period. 
This will prevent detection during a 
setup period. While Figure 7 represents 
a conceptual view, in practice you would 
want to create some positive feedback 
around the amplifier. The positive feed- 
back will provide hysteresis that will help 
filter out noise that could possibly be 
misinterpreted as a real Manchester-en- 
coded transition. 

Once the hysteresis and the RC- 
time constant have been properly 



To decode Manchester-encoded data, perform anexclusive-OR of the Manchester-encoded 
signal with a logical "1 " at the bit-baundary sample points. 
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This analog soluHon for decoding shows a simple circuif friaf can 
achieve fhe same resulfs as Figure 6. 
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setup, the circuit will reliably decode 
Manchester-encoded signals. You will 
also notice that this circuit will work for 
both Option A and Option B. 

CLOCK SYNCHRONIZATION 

Another intrinsic value to Manchester 
encoding is the fact that the synchroniz- 
ing clock is embedded within the signal. 
This fact is exploited in Ethernet, which 
uses on-board circuitry to maintain 
clock synchronization. A Digital Phase 
Locked Loop (DPLL) circuit monitors 
the incoming Manchester-encoded 
signal and makes adjustments to its 
internal oscillator to keep it in constant 
synchronization with the transmitter's 
clock frequency. 

The DPLL functions by sampling 
the incoming Manchester-encoded data 
with its own local clock. A simple shift 
register, driven by the local clock, accu- 
mulates all the shifted bits. 

If the local oscillator is in synchro- 
nization with the transmitter's clock, 
there will be an equal number of binary 
I's and O's across the shift register. If an 
imbalance occurs between binary I's 
and O's, the local clock is adjusted based 
on the number of binary bits off center. 
This is why you wiU find a preamble at 
the beginning of each packet transmit- 
ted via Ethernet. 

Each Ethernet packet starts vfith an 
8-byte (64 bit) preamble, which is used 
by the DPLL to "lock" into the correct 
fi'equency. Since the preamble doesn't 
contain useful data, no data is lost. 
However, it does add more overhead to 
the data stream. 

A more esoteric version of Man- 
chester encoding is a scheme called Dif- 



ferential Manchester encoding (DME). 
Think of it as Manchester encoding on 
steroids. DME is a more efficient en- 
coding scheme because it requires less 
bandwidth than standard Manchester 
encoding. 

The overhead of transmitting a 
data stream using DME is less because 
it doesn't require a preamble, which is 
used by the DPLL to lock onto the clock 
frequency. Because of this, DME can be 
found in networks, such as fast Ethernet 
over copper twisted-pair wiring. 

DME differs from standard Man- 
chester encoding in one simple way: 
Manchester encoding represents binary 
data based on a positive or negative 
edge transition at each bit boundary. 
DME represents data by the presence or 
absence of a transition between two bit 
boundaries. Simply stated, if a transi- 
tion occurs between a bit boundary, it's 
represented as a binary 0. An absence 
of a transition signifies a binary 1. 

As a complement to this reintro- 
duction to the basics of Manchester 
encoding for low-bit serial network ap- 
plications, a second article is available 
online at Embedded.com. The article 
will leverage from the theory presented 
here and offer a practical, real-world 
example that illustrates the simplicity 
of implementing Manchester encoding 
into a real embedded design. 

Robert Guastelia (robert.guastella@tennanJ- 
co) is a senior controls engineer for Ten- 

nant Company in Minneapolis, Minnesota. 
He has over 22 years of experience in 
hardware and software design on products 

ranging from industrial controls, to digital 
servo drives, to automotive electronics. 
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Multicore processors are here to stay but memory is a bottleneck. 




BY JACK C. DANS5LE 



^■^^^^^ or many years, processors and memory evolved more or less in 
HH^^^I lockstep. Early CPUs like the Z80 required a number of machine 
H cycles to execute even a NOP instruction. At the few-megahertz 

H clock rates then common, processor speeds nicely matched 

H EPROM and SRAM cycle times. But for a time, memory speeds 

increased faster than CPU clock rates. The 8088/6 had a prefetcher to better bal- 
ance fast memory to a slow processor. A very small (4 to 6 bytes) FIFO isolated 
the core from a bus interface unit (BIU). The BIU was free to prefetch the most- 
likely-needed next instruction if the core was busy doing something that didn't 



need bus activity. The BIU thus helped maintain 
a reasonable match between CPU and memory 
speeds. 

Even by the late 1980s, processors were pretty 
well matched to memory. The 386, which (wMiitfie 
exception of floating-point instructions) has a pro- 
grammer's model very much like Intel's latest high- 
end offerings, came out at 16 MHz. The three-cycle 
NOP instruction thus consumed 188 nsec, which 
partnered well with most zero wait-state memory 
devices. 

But clock rates continued to increase while mem- 
ory speeds started to stagnate. The 386 went to 40 
MHz, and the 486 to over 100. Some of the philoso- 
phies of the reduced instruction set (RISC) move- 




ment, particularly single-clock instruction execution, 

were adopted by CISC vendors, further exacerbating the mismatch. 

Vendors turned to Moore's Law as it became easier to add lots of transistors to 
processors to tame the memory bottleneck. Pipelines sucked more instructions on- 
chip, and extra logic executed parts of many instructions in parallel. 

A single-clock 100 MHz processor consumes a word from memory every 10 
nsec, but even today that's pretty speedy for RAM and impossible for flash. So on- 
chip iat&ite ^^cted, again exploiting cheap integrated transistors. That, plus float- 
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multicore offerings from Intel, AMD, 
and some others are SMP devices. 

SMP will yield performance im- 
provements only (at best) insofar as 
a problem can be parallelized. Santa's 
work cannot be parallelized (unless he 
gives each elf a sleigh), but delivering 
mail-order products keeps a fleet of UPS 
trucks busy and efKcient. 

Amdahl's Law gives a sense of the 
benefit accrued from using multiple 
processors. In one form, it gives the 
maximum speedup as: 

1 

/+(!-/)/« 



meant the 486's transistor budget was 
over four tunes as large as the 386. 

Pentium-class processors took 
spmls to unparalleled extremes, before 
long hitting two and three gigahertz. 
Memory devices at 0.33 nsec are im- 
practical for a variety of reasons, not the 
least of which is the intractable problem 
of propagating those signals between 
chip packiges. Few users would be con- 
tent with a 3-GHz processor stalled by 
issuing 50 wait states for each memory 
read or write, so cache sizes increased 
more. 

But even on-chip, zero wait-state 

memor)' is expensive. Caches multi- 
plied, with a small, fast LI backed up by 
a slower 12 and in some cases even an 
L3. Yet more transistors implemented 
immensely complicated speculative 
branching algorithms, cache snooping 
and more, all in the interest of manag- 
ing the cache and reducing inherently 
slow bus traffic. 

And that's the situation today. 
Memory is much slower than processors 
and has been an essential bottleneck for 
fifteen years. Recently CPU speeds have 
stalled as well, limited now by power 
dissipation problems. As transistors 
switch, small inefficiencies convert a 
tiny bit of V„ to heat. And even an idle 
transistor leaks microscopic amounts 
of cmrent. Small losses multipKed by 
hundreds of millions of devices means 
very hot parts. 

Ironically, vast numbers of the 
transistors on a modern processor do 
nothing most of the time. No more 
than a single line of the cache is active 
at any time, most of the logic to handle 
hundreds of different instructions 
stands idle till infrequentiy needed, and 
page translation units that manage gi- 
gabytes handle a single word at a time. 

But those idle transistors do con- 
vert the power supply to waste heat. 
The "transistors are free" mantra is 
now stymied by power concerns. So 
limited memory speeds helped spawn 
hugely complex CPUs, but the resultant 
heat has curbed clock rates, formerly 
the biggest factor that gave us faster 
computers every year. 



similar dynamics were at work. Gal- 
Uum arsenide logic and other exotic 
components drove clock rates high, 
and liquid cooling kept machines from 
burning up. But long ago, researchers 
recognized the futility of making much 
additional progress by spinning the 
clock-rate wheel ever higher and started 
building vastly parallel machines. Most 
today employ thousands of identical 
processing nodes, often based on pro- 
cessors used in standard desktop com- 
puters. Amazing performance comes 
from massively parallelizing both the 
problems and the hardware. 

To continue performance gains, 
desktop CPU vendors co-opted the su- 
percomputer model and today offer a 
number of astonishing multicore devic- 
es, which are just two or more standard 
processors assembled on a single die. 
A typical configuration has two CPUs, 
each with their ovm LI cache. Both share 
a single 12, which connects to the out- 
side world via a single bus. Embedded 
versions of these parts are available as 
well and share much with their desktop 
cousins. 

Symmetric multiprocessing (SMP) 
has been defined in a number of differ- 
ent ways. I chose to call a design using 
multiple identical processors that share 



where /is the part of the computa- 
tion that can't be parallelized, and n 
is the number of processors. With an 
infinite number of cores, assuming no 
other mitigating circumstances. Figure 
1 shows (on the vertical axis) the pos- 
sible speedup versus (on the horizontal 
axis) the percentage of the problem that 
can't be parallelized. 

The Law is hardly engraved in stone 
as there are classes of problems called 
"embarrassingly parallel" where huge 
numbers of calculations can take place 
simultaneously. Supercomputers have 
long found their niche in this domain, 
which includes problems like weather 



Possible speedup versus percen^age of fhe problem l"haf 
cannot be parallelized, assuming an infinil'e number of cores. 
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prediction, nuclear simulations, and 
the like. 

The crucial question becomes: How 

much can your embedded application 
benefit from parallelization? Many 
problems have at least some amount of 
work that can take place simultaneous- 
ly. But most problems have substantial 
interactions between components that 
must take place in a sequence. It's hard 
at best to decide at the outset, when one 
is selecting the processor, how much 
benefit we'll get from going multicore. 

Marketing literatiure firom multicore 
vendors suggest that a two-core system 
can increase system performance from 
30 to 50% (for desktop apps; how that 
scales to embedded systems is another 
question entirely, one that completely 
depends on the application). Assuming 
the best case (50%) and working Am- 
dahl's Law backwards, one sees that the 
vendors assume a third of the PC pro- 
grams can be parallelized. That's actually 
a best, best case, as a PC runs many dif- 
ferent bits of software at the same time 
and could simply split execution paths 
by application. But, pursuing this line 
of reasoning, assuming the dramatic 
50% speed improvement comes from 
running one program, the Law shows 
that with an infinite number of proces- 
sors the best one could hope for would 
be a 3x performance boost (excepting 
the special case of intrinsically parallel 
programs). 

Then there's the bus bottleneck. 

Each of the twins in a dual-core 
SMP chip has its own zero wait-state 
cache, which feeds instructions and 
data at sizzUng rates to the CPU. But 
once off LI, they share an L2, which 
though fast, stalls every access with a 
couple of wait states. Outside of the 
L2, a single bus serves two insanely 
high-speed processors that have raven- 
ous appetites for memory cycles, cycles 
slowed by so many wait states as to 
make the processor clock rate for ofif- 
chip activity almost irrelevant. 

And here's the irony: a multi-giga- 
hertz CPU that can address hoards of 
gigabytes of memory, that has tens of 
millions of transistors dedicated to 
speeding up operations, runs mind- 



numbingly fast only as long as it ex- 
ecutes out of LI, which is typically a 
microscopic 32 to 64 kbytes. PIC-sized. 
Run a bigger program or one that uses 
lots of data, and the wait-state logic 
jumps on the brakes 

A couple of Z80s might do almost 
as well. 

In the embedded world, we have 
more control of our execution environ- 
ment and the program itself than in a 
PC. Some of the RTOS vendors have 
come up with clever ways to exploit 
multicore more efficientiy, such as pin- 
ning tasks to particular cores. I have seen 
a couple of dramatic successes with this 
approach. 

If a task fits entirely within LI, or 
even spills over to L2, expect tremen- 
dous performance boosts. But it still sort 
of hurts ones head - and pocketbook 
- to constrain such a high-end CPU to 
such small hunks of nmagecy. 

Any program that runs on and off 
cache may suffer from determinism 
problems. What does "real time" mean 
when a cache miss pitolongs execution 
time by perhaps an order of magnitude 
or more? Again, your mileage may vary 
as this is an extremely appleation-de- 
pendent issue, but proving a real-time 
system runs deterministically is hard at 
best Cache, pipelines, ^jeCnlative execu- 
tion, and now two CPUs competing for 
the same slow bus all greatly complicate 
the issue. By definition, a hard real-time 
system that misses a deadline is as bro- 
ken as one that has completely defective 
code. 

Multicore does address a very 
important problem, that of power 
consumption. Some vendors stress 
that their products are more about 
MlPs/watt than raw horsepower. Cut 
the CPU ebdc a bit, double the number 
of processors, and the total power needs 
drop dramatically. With high-end CPUs 
sucking 100 watte or more (at just over 
a volt; do the math and consider how 
close that is to the amps needed to start 
a car), power is a huge concern, particu- 
larly in embedded systems. Most of the 
SMP approaches that I've seen, though, 
still demand tens of watts, far too much 
for many classes of embedded systems. 



One wonders if a multicore 
proach using multiple 386s stripped of 
most of their fancy addressing capability 
and other bus management featiu-es, 
supported by lots of "cache," or at least 
fast on-board RAM, wouldn't offer a bet- 
ter MlPS/watt/price match, at least in the 
embedded space where gigantic applica- 
tions are relatively rare. 

Finally, the holy grail of SMP for 
30 years has been an auto-paraUelizing 
compiler, something that can take a 
sequential problem and divide it among 
many cores. Progress has been made, 
and much work continues. But it's still 
a largely unsolved problem that is be- 
ing addressed in the embedded world 
at the operating-system level. QNX, 
Green Hills, and others have some very 
cool tools that partition tasks both stati- 
cally and dynamically among cores. But 
expect new sorts of complex problems 
that make programming a multicore 
system challenging at best. 

HERETO STAY 

Although this rant may be seen by 

some to be completely dismissive of 
multicore, that's not the case at all; 
my aim is to shine a little light into 
the marketing FUD that permeates 
multicore, as it does with the introduc- 
tion of any new technology. Mtilticore 
processors are here to stay and do offer 
some important benefits. You may find 
some impressive performance gains 
by employing SMP, depending upon 
your specific apphcation. Do see David 
Kleidermacher's article about it at www. 
embedded.com/design/205203908. 

Most of my complaints disappear 
when each core runs code from its own 
memory space. Tensilica has some cus- 
tomers getting astonishing performance 
gains using this approach. 

Picochip's results are impressive as 
well (see www.insidedsp.com/Articles/ 
tabid/64/articleType/ArticleView/ar- 
ticleId/228/Default.aspx for a recent 
benchmark). But that's not SMP and is 
a completely different diseussion. 

jack G. Ganssle (jack@ganssle.conn) is 
a lecturer and consultcmi- dt] embedded 
development issues. 
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Intel adds processors, extends life 



Intel has expanded it range of devices 
for embedded market segments with 
processors with extended, 7-year Hfe 
cycle support, a new chipset and a 
carrier-grade server. The processors, 
based on Intel's high-k, metal gate 
transistor formula and manufactured 
on the company's 45-nanometer (nm) 
process, include the Quad-Core Intel 
Xeon processor 5400 Series and Dual- 
Core Intel Xeon processor 5200 series. 

These new processors, coupled with 
the new power-optimized Intel 5100 
memory controller hub (MCH) chip- 
set, comprise the first 45rmi CPU plat- 
forms for thermally constrained bladed 
applications. When using the Intel 
5000P chipset, the 45nm processors 
are suitable for fiill-performance and 
memory-intensive applications such as 
Storage, routers, security and medical 
solutions, as weU as communications 
applications such as IP multimedia sub- 
systems (IMS). 

The processors take advantage of 
Intel's Hafnium-based, high-k metal 



gate transistor formula, which reduces 
power consumption, increases switch- 
ing speed and significantly increases 
transistor density over the company's 
previous 65nm manufacturing technol- 
ogy. These 45nm CPU-based platforms, 
based on the Intel 5100 MCH chipset, 
are suitable for 200 watt maximum 
power envelope specifications such 
as AdvancedTCA and also for NEBS 
Level-3 requirements. 

Intel says it recorded a 67 percent 
increase it compute performance-per- 
watt when it validated the Intel 5100 
MCH chipset-based 45nm quad-core 
platform. These comparisons were 
done using 2x Quad-Core Intel Xeon 
processor L53 1 8 on an Intel 5000P 
chipset-based platform versus 2x Quad- 
Core Intel Xeon processor L5408 on an 
Intel 5100 chipset-based platform using 
the same workloads . It says that the 
45nm quad-core processors also allow 
for a 22 percent performance gain over 
previous-generatioii pktfon^ within 
the same thermal profile. 



Intel is offering extended Ufecyde 
support for 7 years for the dual-core In- 
tel Xeon processor 5200 series (E5240, 
E5220, L5238) and the Quad-Core Intel 
Xeon processor 5400 series (E5440 and 
L5408). This represents an expansion 
from previous minimum support of 5 
years. 

The new Intel carrier grade server 
TIGH2U building block offers in- 
creased choice for customers that re- 
quire power efficiency and improved 
compute performance for high-end 
communication applications. Intel 
also armounced enhancements for its 
TIGWIU carrier grade server, NSWIU 
IP network server and NSC2U IP net- 
work server. These communication 
rack-mount servers now support the 
Quad-Core Xeon processoi- 54()0 scries 
and are suitable for telco and network 
applications in harsh enviroiunents 
with NEBS Level-3 requirements that 
demand high performance, energy ef- 
ficiency and high I/O throughput. 
www.inteLcom 



Memory controller improves power and reliability 



Hyperstone has introduced the F3 
family of Flash Memory controllers for 
solid state disk (SSD), disk-on-module 
(DoM), CompactFlash (CP) cards, 
and embedded flash. The F3 provides 
lower power consumption, higher reli- 
ability, endurance, and rigorous power 
fail safe features for aU single level 
cell (SLC) and multi level cell (MLC) 
based flash memory solutions. 

Based on a 32-bit RISC core includ- 
ing instruction set extensions opti- 
mized for flash handling, the F3 offers 
safe power fail handling, proven error 
detection and correction, superior wear 
leveling that involves all physical blocks 
including the ones containing static 



data, satisfying the most demanding 
requirements regarding data traffic and 
power fail situations. 

It is fully compliant to Com- 
pactFlash 3.0 and compatible to 4.1 
specifications and has a fast ATA sup- 
porting PIO mode 6, MDMA mode 
4, UDMA mode 4 in True-IDE mode 
with sustained read tq> to 45 Mbit/s 
and random read up to 35 Mbit/s and 
sustained write up to 30 Mbit/s and 
random write up to 6 Mbit/s. 

It provides two direct flash access 
(DFA) channels including two sector 
buffers and interleaving capability as 
well as error correcting code (ECC) 
capable of correcting 4 symbols in a 512 



bytes sector with additional CRC. 

As firmware is stored in flash mem- 
ory, all future flashes can be supported 
by simple firmware upgrades while a 
built-in voltage regulator, and detector, 
reducing the bill-of-materials to a few 
additional capacitors and resistors. 

System operation current is 75 
mA max. depending on flash, auto- 
matic power-down mode, power saving 
mode, automatic wake-up with sleep 
mode current < 200 ^A. 

The F3 is available as known-good- 
die, in a 100 or 128 pin TQFP with and 
in either industrial or eonstmier tem- 
perature range. 
www.hyperstone.com 
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Navigation module extends robust CPU cards 



With the introduction of MSG Ver- 
triebs GmbH's EXM32 navigation and 
communication module the compa- 
ny's CPU modules EXM32-AU1250 
or EXM-SH7760 are enhanced with 
quad band 800/900/1800/1900 MHz 
EGSM and GPRS as well as with GPS 
functions. UMTS/HSDPA/EDGE is 
supported optionally. An external gyro 
sensor module can be connected for 
inertial navigation when satellite con- 
tact is not available. 

Thanks to a special connec 
tor technology, the compact 
EXM-NAVCOM module, 
with dimensions of only 65 
mm X 90 mm, can be eas- 
ily mounted on the CPU 
board. Therefore, powerful 
navigation, telematics and 
fleet management systems 
can be implemented with 
very little effort. 

The module is designed for 
an extended operating temperature 
range from -30 to +S5°C. 

In combination with a suitable 
GPS/GSM dual antenna, the 16 channel 
GPS receiver on the EXM-NAVCOM 
module provides the current position 
with an accuracy of 2.5 meters (CEP). 
In steady state condition the calcula- 
tion of a new position takes less than 1 



second, while with good signal strength 
the "cold start" takes less than 34 sec- 
onds. A reliable voice communication 
is possible via the implemented 12S 
audio interface. An integrated socket is 
provided for a SIM card. 

The EXM32-AU1250 CPU module 
from MSG Vertriebs GmbH, designed 
especially for harsh industrial environ 



ment, offers good processing perfor- 
mance and multimedia features with 
low power consumption. 

The device complies with the 
EXM32 system-on-module (SOM) 
standard, developed by MSG. The 




EXM32-AU1250 is based on the RMI 
Alchemy Aul250 RISC-CPU with 16 
kbyte data and 16 kbyte instruction 
cache. It is available in two versions. In 
the commercial variant, specified for an 
ambient temperature range from to 
70 °C, the processor operates at a clock 
frequency of 600MHz. The variant for 
industrial ambient temperature range 
from -40 to -l-85°C operates at 500MHz. 
The powerful CPU module is attached 
to the carrier board using stable elas- 
tomeric connectors. 

For applications where 
computing power and float- 
ing-point support are more 
important than multimedia 
features, the EXM-SH7760, 
based on Renesas' SH7760 
RISC CPU, is available from 
the same module family. 
For evaluation and design- 
in of the EXM32 modules, 
MSG Vertriebs GmbH provides 
special development platforms and 
starter kits. 

The mamifecturer offers support 
for developing custom-specific carrier 
boards. In addition to EXM32 mod- 
ules, the appropriate antenna kit and 
compatible gyro sensor module are also 
available from MSG. 
www.insc-ge.eom 



RoweBots upgrades Linux and POSIX compatible RTOS 



RoweBots Research Inc. has launched 

Unison Version 4 for the 32 bit PIC32 
micro-controller (MCU) family from 
Microchip Technology. 

Unison is equivalent to a tiny em- 
bedded Linux RTOS which provides 
complete indenmification. Unison pro- 
vides a very small open source POSIX 
RTOS which aims at increasing System 
on Chip (SoC) embedded development 
productivity and reliability. 

The Unison Version 4 completes 
the line of offerings for all Microchips 
processors from Microchip's PIC24 
16 bit MCUs through the dsPIC 30/33 



DSCs to the PIC32 MCUs. Unison 

and DSPnano offer identical features 
and seamless support with including: 
integrated SoC DSP RTOS with full 
POSIX capabilities and a tiny foot print 
to minimize training time and proces- 
sor size, DSP libraries widi 150 fimc- 
tions for off the shelf tried and proven 
processing, complete I/O minimizing 
development and integration, free de- 
velopment, seamless integration with 
Microchip's MPLAB IDE for C instruc- 
tion level simulation, compiling and 
debugging, and seamless migration be- 
tween products without code changes. 



Unison V4 is hosted on Windows 
XP and Vista, for x86 platforms, and 
will begin shipping immediately. 

V4 offers seamless upgrading from 

DSPnano running on PIC24 and dsPIC 
30/33 to PIC32 without application 
changes. 

The Freescale Coldfire V2 processor 
is also another SoC solution that is be- 
ing supported. Like the other DSPnano 
chip support packages, Unison chip 
support will include a fuU complement 
of peripherals to eliminate low level 
programming. 
www.rowebots.com 
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ST introduces 8-bit microcontroller platform 
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STMicroelectronics 
has released details of 
a 8-bit microcontroller 
platform. Combin- 
ing an 8-bit core and a 
set of peripherals, the 
STM8 platform will be 
manufactured using an 
ST-proprietary ISOnm 
embedded non-volatile 
memory technology. 

The STM8 platform 
will be the foundation 
for a number of distinct 
product famUies each 
optimized for specific 
application areas, including automo- 
tive, industrial, low voltage and battery- 
operated applications, as well as appU- 
cation-specific standard products. 

The STM8 leverages a Harvard 
architecture with 16-bit index registers 
and stack pointer, a 16Mbyte linear 
address space, advanced addressing 
modes and other features designed to 
optimally support C-programming to 
deliver leading-edge CPU performance 
in both speed and code density. The 
core reaches an average of 1.6 cycles 
per instruction with 20 MIPS of peak 
performance at 24MHz using a 3-stage 
pipeline. 

The STM8 platform offers real em- 
bedded EEPROM with a performance 
in endurance and retention comparable 
to those of external components, mak- 




ing complex flash-based emulation 
strategies obsolete. The available on- 
chip flash program memory sizes will 
range up to 256K. 

The STM8 peripherals are homo- 
geneous across ST's 32-bit |aC families 
such as the STM32 range, making the 
STM8 a perfect complement to ST's 32- 
bit MCU portfolio. ST will introduce 
the first STM8 famUy in the first half of 
2008. 

■ STMicroelectronics has made the 
NicheLit TCP/IP stack available free 
of charge with its STR91x 32-bit flash 
microcontrollers for networking ap- 
plications. ST and InterNiche have 
tailored the standard stack for optimal 
performance with the STR91x. The 
stack capabiWes can be extended with 



add-on modules already 
available from Inter- 
Niche. 

As a fully-featured 
TCP/IP stack with a 
memory footprint of 
less than 12 kbytes, 
NicheLite includes a 
compact TCP layer 
optimized for simple 
memory management, 
data copy prevention, 
low memory usage and 
high data throughput. 
The NicheLite IP layer 
supports one hardware 
interface and is compatible with Inter- 
net, email and network-management 
protocols also available fi-om Inter- 
Niche. 

These include NicheStack PPP, 
FTP, Telnet server, HTTP server, 
DHCP server, email, and SNMP. Other 
protocols supported include address 
resolution protocol, internet control 
message protocol, user datagram proto- 
col and client bootstrap protocol. 

Customers developing applications 
for the STR91x family can implement 
NicheLite at no charge, with the option 
to purchase support from InterNiche 
as required. Included with the stack is 
the NicheTool debugging and tuning 
software suite, which provides powerfiil 
porting and optimization capabilities. 
■www.stcom 



Green Hills to support OMAP35x and DaVind 



Green Hills Software, has announced 
plans to support the Texas Instru- 
ments (TI) OMAP35x superscalar ap- 
plications processors. The Green Hills 
solution for the OMAP3503 consists 
of MULTI integrated development en- 
vironment; compilers that claim 15% 
better code performance and density 
than ARM compilers; DoubleCheck 
static code analyzer; TimeMachine 
debugger, which enables forward- 



and-reverse code execution; and a fast 
simulator. 

The Green Hills tools are notable 
for their OS-aware development and 
debugging. The tools have built-in sup- 
port for Green Hill's OSes, as well as for 
Linux. Green Hills also provides exten- 
sive multi-core development support, a 
factor that will valuable as TI rolls out 
DSP-l-ARM OMAP processors. 

Green Hills is also supporting the 



Texas Instruments (TI) DM355 DaVin- 
ci digital media processor. Green HiUs 
Software's family of operating systems, 
optimizing C/C++ compilers, advanced 
debugger, MULTI IDE, and Double- 
Check are available today for DM355. 
TimeMachine is available today for the 
integrated ARM instruction set simula- 
tor and supports the embedded trace 
buffer on the DM355. 
www.ghs.com 
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Autosar development kit 



Geensys SA (Boulogne-Billancourt, 
France) has launched its first in a 
planned series of Autosar develop- 
ment kits at a price of €25,000. It 
comes with five days onsite support 
and one year's maintenance. The kit 
is said to enable automotive engineers 
to learn about and practice Autosar 
software development in an easy and 



flexible way- Geensys' first Autosar 
development kit combines a hard- 
ware evaluation board, v«th Freescale 
S12X microcontroller, basic software 
modules compliant with Autosar 2.1 
and the complete Autosar Builder tool 
chain It also includes CANbus and 
NVRAM demonstration examples. 
www.geensys.com 



Development 
tools offered for 
Stellaris MCUs 

Code Red Technologies Ltd. (Cam- 
bridge, England) a provider of soft- 
ware development tools for 32-bit 
microcontrollers has ported its Code 
Suite development tools to the Stel- 
laris microcontroller range from Lu- 
minary Micro (Austin, Texas). 

The two companies also an- 
nounced the availability of SteUaris 
evaluation kits featuring an evaluation 
version of Code Suite development 
tools which is always unrestricted 
when used with the evaluation board. 

Code Suite is bult on top of the 
EcUpse integrated development envi- 
ronment with enhancements added by 
Code Red. Code Suite also works with 
real-time operating systems for designs 
where the resources of a real-time ker- 
nel are required, and includes a FreeR- 
TOS.orgT project wizard. 

Code Suite incorporates Code 
Trace, a software feature that capital- 
izes on the serial wire viewing (SWV) 
technology in Stellaris microcon- 
trollers to deliver unprecedented 
visibility during debug. Code Trace 
delivers profiling, interrupt statistics, 
interrupt tracing, RTOS event tracing, 
and a host-strings viewing mechanism 
which provides printf style debugging 
and losing without intrusiveness. 
Interrupt tracing provides graphical 
visualization of what is occurring in a 
real time system. 

Evaluation kits are available from 
Luminary Micro and Code Red Tech- 
nologies that incorporate Code Suite, 
and allow the use of the fiill unre- 
stricted use of the development envi- 
ronment. These kits, which are priced 
from $49 to $89, are available for a 
number of different configurations. A 
full license to Code Suite development 
tools is available through an online 
upgrade of the evaluation kits from the 
Code Red Technologies website. 
www.code-red-tech.com/ 



Run Time Environment (RTE) 




Real-time development support 



Infineon Technologies AG (Munich, 
Germany) has announced a develop- 
ment support kit with a CANopen 
development environment for its 16- 
bit XE166 family of real-time signal 
controllers for about $75 

The UConnect-CAN development 
kit enables the design of networked 
real-time motor control systems for 
industrial applications, such as servo- 
drives, air-conditioning systems, step- 
per motors and pumps. The Uconnect 
kit allows analysis of the CANopen 
protocol and application code for mo- 
tor control designs. The kit is supplied 
on a USB memory stick. 

The UConnect USB stick provides 
evaluation capabilities for the 16-bit 



XE164 microcontroller. The kit also 
contains an extension board for ex- 
panding the possible interfaces such as 
synchronous and asynchronous serial 
interfaces, six PWM channels, four AD 
channels and a second CAN interface. 

Also included in the kit is the 
CANopen protocol stack from Port 
GmbH, a one-year Ucense-free com- 
piler from Altium as well as a software 
configuration tool, along with all docu- 
mentation. 

the development kit UConnect- 
CAN is due to become available in 
the second quarter of 2008 and can be 
ordered from Infineon sales offices and 
distributors for 49 euros (about $75). 
vmw.infineon.com 
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ColdFire® Architecture. The perfect 32-bit compilation. 
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The ColdFire Architecture has a 15-year heritage steeped in innovation 
With more than 100 individual devices, ColdFire has one of the broadest 
32-bit portfolios in the industry. Whether it's USB, Ethernet, CAN 
or ZigBeef ColdFire connects and controls millions of networked 
products in homes, offices and factories around the world. ColdFire 
also enables your design to keep pace with today's eco-trends 
thanks to its industry-leading, ultra-low-power 32-bit MCU. And to 
make its own ecosystem even stronger and more vibrant, we are now 
openly licensing ColdFire VI . Find out what ColdFire can do for you. 




MeetColdFireatfreescale.com/coldfire101 ftGGSCSlG 
and save 50% on the QE128 Development Kit* semiconductor 

• While supplies last, 
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